Human calcium channel α1, α2, and β subunits and assays using them

ABSTRACT

Isolated DNA encoding each of human calcium channel α 1  -, α 2  -, β- and γ-subunits, including subunits that arise as splice variants of primary transcripts, is provided. Cells and vectors containing the DNA and methods for identifying compounds that modulate the activity of human calcium channels are also provided.

This application is the national stage (35 U.S.C. §371) of International Application PCT/US92/6903, filed Aug. 14, 1992, which is a continuation-in-part of U.S. Ser. No. 07/868,354, filed Apr. 10, 1992, now abandoned, and is also a continuation-in-part of U.S. Ser. No. 07/745,206, filed Aug. 15, 1991, now U.S. Pat. No. 5,429,921.

U.S. Ser. No. 07/868,354 is a continuation-in-part of U.S. Ser. No. 07/745,206, which is a continuation-in-part of U.S. Ser. No. 07/620,250, filed Nov. 30, 1990, which is a continuation-in-part of U.S. Ser. No. 07/176,899, filed Apr. 4, 1988, now abandoned. U.S. Ser. No. 07/868,354 and 07/745,206 are also a continuation-in-part of U.S. Ser. No. 07/482,384, filed Feb. 20, 1990, and are also a continuation-in-part of 07/603,751, filed Apr. 4, 1989, now abandoned.

TECHNICAL FIELD

The present invention relates to molecular biology and pharmacology. More particularly, the invention relates to calcium channel compositions and methods of making and using the same.

BACKGROUND OF THE INVENTION

Calcium channels are membrane-spanning, multi-subunit proteins that allow controlled entry of Ca²⁺ ions into cells from the extracellular fluid. Cells throughout the animal kingdom, and at least some bacterial, fungal and plant cells, possess one or more types of calcium channel.

The most common type of calcium channel is voltage dependent. "Opening" of a voltage-dependent channel to allow an influx of Ca²⁺ ions into the cells requires a depolarization to a certain level of the potential difference between the inside of the cell bearing the channel and the extracellular medium bathing the cell. The rate of influx of Ca²⁺ into the cell depends on this potential difference. All "excitable" cells in animals, such as neurons of the central nervous system (CNS), peripheral nerve cells and muscle cells, including those of skeletal muscles, cardiac muscles, and venous and arterial smooth muscles, have voltage-dependent calcium channels.

Multiple types of calcium channels have been identified in mammalian cells from various tissues, including skeletal muscle, cardiac muscle, lung, smooth muscle and brain, see, e.g., Bean, B. P.(1989) Ann. Rev. Physiol. 51:367-384 and Hess, P. (1990) Ann. Rev. Neurosci. 56:337!. The different types of calcium channels have been broadly categorized into four classes, L-, T-, N-, and P-type, distinguished by current kinetics, holding potential sensitivity and sensitivity to calcium channel agonists and antagonists.

Calcium channels are multisubunit proteins. For example, rabbit skeletal muscle calcium channel contains two large subunits, designated α₁ and α₂, which have molecular weights between about 130 and about 200 kilodaltons ("kD"), and one to three different smaller subunits of less than about 60 kD in molecular weight. At least one of the larger subunits and possibly some of the smaller subunits are glycosylated. Some of the subunits are capable of being phosphorylated. The α₁ subunit has a molecular weight of about 150 to about 170 kD when analyzed by sodium dodecylsulfate (SDS)-polyacrylamide gel electrophoresis (PAGE) after isolation from mammalian muscle tissue and has specific binding sites for various 1,4-dihydropyridines (DHPs) and phenylalkylamines. Under non-reducing conditions (in the presence of N-ethylmaleimide), the α₂ subunit migrates in SDS-PAGE as a band corresponding to a molecular weight-of about 160-190 kD. Upon reduction, a large fragment and smaller fragments are released. The β subunit of the rabbit skeletal muscle calcium channel is a phosphorylated protein that has a molecular weight of 52-65 kD as determined by SDS-PAGE analysis. This subunit is insensitive to reducing conditions. The γ subunit of the calcium channel, which is not observed in all purified preparations, appears to be a glycoprotein with an apparent molecular weight of 30-33 kD, as determined by SDS-PAGE analysis.

In order to study calcium channel structure and function, large amounts of pure channel protein are needed. Because of the complex nature of these multisubunit proteins, the varying concentrations of calcium channels in tissue sources of the protein, the presence of mixed populations of calcium channels in tissues, difficulties in obtaining tissues of interest, and the modifications of the native protein that can occur during the isolation procedure, it is extremely difficult to obtain large amounts of highly purified, completely intact calcium channel protein.

Characterization of a particular type of calcium channel by analysis of whole cells is severely restricted by the presence of mixed populations of different types of calcium channels in the majority of cells. Single-channel recording methods that are used to examine individual calcium channels do not reveal any information regarding the molecular structure or biochemical composition of the channel. Furthermore, in performing this type of analysis, the channel is isolated from other cellular constituents that might be important for natural functions and pharmacological interactions.

Characterization of the gene or genes encoding calcium channels provides another means of characterization of different types of calcium channels. The amino acid sequence determined from a complete nucleotide sequence of the coding region of a gene encoding a calcium channel protein represents the primary structure of the protein. Furthermore, secondary structure of the calcium channel protein and the relationship of the protein to the membrane may be predicted based on analysis of the primary structure. For instance, hydropathy plots of the α₁ subunit protein of the rabbit skeletal muscle calcium channel indicate that it contains four internal repeats, each containing six putative transmembrane regions Tanabe, T. et al. (1987) Nature 328:313!.

The cDNA and corresponding amino acid sequences of the α₁, α₂, β and γ subunits of the rabbit skeletal muscle calcium channel see, Tanabe et al. (1987) Nature 328:313-318; International Application No. WO 89/09834, which is U.S. application Ser. No. 07/603,751, now abandoned, which is a continuation-in-part of U.S. application Ser. No. 07/176,899 also now abandoned; Ruth et al. (1989) Science 245:1115-1118; and U.S. patent application Ser. No. 482,384, filed Feb. 20, 1990! have been determined. The cDNA and corresponding amino acid sequences of α₁ subunits of rabbit cardiac muscle Mikami, A. et al. (1989) Nature 340:230-233! and lung (Biel, M. (1990) FEBS Letters 269:409-412) calcium channels have been determined.

In addition, a cDNA clone encoding a rabbit brain calcium channel (designated the BI channel) has been isolated Mori, Y. et al. (1991) Nature 350:398-402!. Partial cDNA clones encoding portions of several different subtypes, referred to as rat brain class A, B, C and D, of the calcium channel α₁ subunit have been isolated from rat brain cDNA libraries Snutch, T. et al. (1990) Proc. Natl. Acad. Sci. USA 87:3391-3395!. More recently full-length rat brain class, A Starr, T. et al. (1991) Proc. Natl. Acad. Sci. USA 88:5621-5625! and class C Snutch, T. et al. (1991) Neuron 7:45-57! cDNA clones have been isolated. Although the amino acid sequence encoded by the rat brain class C DNA is approximately 95% identical to that encoded by the rabbit cardiac muscle calcium channel α₁ subunit-encoding DNA, the amino acid sequence encoded by the rat brain class A DNA shares only 33% sequence identity with the amino acid sequence encoded by the rabbit skeletal or cardiac muscle α₁ subunit-encoding DNA. A cDNA clone encoding another rat brain calcium channel α₁ subunit has also been obtained Hui, A. et al. (1991) Neuron 7:35-44). The amino acid sequence encoded by this clone is -70% homologous to the proteins encoded by the rabbit skeletal and cardiac muscle calcium channel DNA. A cDNA clone closely related to the rat brain class C α₁ subunit-encoding cDNA and sequences of partial cDNA clones closely related to other partial cDNA clones encoding-apparently different calcium channel α₁ subunits have also been isolated see Snutch, T. et al. (1991) Neuron 1:45-57; Perez-Reyes, E. et al. (1990) J. Biol. Chem. 265:20430; and Hui, A. et al. (1991) Neuron 7:35-44!. DNA clones encoding other calcium channels have also been identified and isolated.

Expression of cDNA encoding calcium channel subunits has been achieved with several of the different rabbit or rat α₁ subunit cDNA clones discussed above. Voltage-dependent calcium currents have been detected in murine L cells transfected with DNA encoding the rabbit skeletal muscle calcium channel α₁ subunit Perez-Reyes et al. (1989) Nature 340:233-236 (1989)!. These currents were enhanced in the presence of the calcium channel agonist Bay K 8644. Bay K 8644-sensitive Ba²⁺ currents have been detected in oocytes injected with in vitro transcripts of the rabbit cardiac muscle calcium channel α₁ subunit cDNA Mikami, A. et al. (1989) Nature 340:230-233!. These currents were substantially reduced in the presence of the calcium channel antagonist nifedipine. Barium currents of an oocyte co-injected with RNA encoding the rabbit cardiac muscle calcium channel α₁ subunit and the RNA encoding the rabbit skeletal muscle calcium channel α₂ subunit were more than 2-fold larger than those of oocytes injected with transcripts of the rabbit cardiac calcium channel α₁ subunit-encoding cDNA. Similar results were obtained when oocytes were co-injected with RNA encoding the rabbit lung calcium channel α₁ subunit and the rabbit skeletal muscle calcium channel α₂ subunit. The barium current was greater than that detected in oocytes injected only with RNA encoding the rabbit lung calcium channel α₁ subunit Biel, M. et al. (1990) FEBS Letters 269:409-412!. Inward barium currents have been detected in oocytes injected with in vitro RNA transcripts encoding the rabbit brain BI channel Mori et al. (1991) Nature 350:398-402!. These currents were increased by two orders of magnitude when In vitro transcripts of the rabbit skeletal muscle calcium channel α₂, β, or α₂, β and γ subunits were co-injected with transcripts of the BI-encoding cDNA. Barium currents in oocytes co-injected with transcripts encoding the BI channel and the rabbit skeletal muscle calcium channel α₂ and β were unaffected by the calcium channel antagonists nifedipine or ω-CgTx and inhibited by Bay K 8644 and crude venom from Agelenopsis aperta.

The results of studies of recombinant expression of rabbit calcium channel α₁ subunit-encoding cDNA clones and transcripts of the cDNA clones indicate that the α₁ subunit forms the pore through which calcium enters cells. The relevance of the barium currents generated, in these recombinant cells to the actual current generated by calcium channels containing as one component the respective α₁ subunits in vivo is unclear. In order to completely and accurately characterize and evaluate different calcium channel types, however, it is essential to examine the functional properties of recombinant channels containing all of the subunits as found in vivo. Although there has been limited success in expressing DNA encoding rabbit and rat calcium channel subunits, far less has been achieved with respect to human calcium channels. Little is known about human calcium channel structure and function and gene expression. An understanding of the structure and function of human calcium channels would permit identification of substances that, in some manner, modulate the activity of calcium channels and that have potential for use in treating such disorders.

Because calcium channels are present in various tissues and have a central role in regulating intracellular calcium ion concentrations, they are implicated in a number of vital processes in animals, including neurotransmitter release, muscle contraction, pacemaker activity, and secretion of hormones and other substances. These processes appear to be involved in numerous human disorders, such as CNS and cardiovascular diseases. Calcium channels, thus, are also implicated in numerous disorders. A number of compounds useful for treating various cardiovascular diseases in animals, including humans, are thought to exert their beneficial effects by modulating functions of voltage-dependent calcium channels present in cardiac and/or vascular smooth muscle. Many of these compounds bind to calcium channels and block, or reduce the rate of, influx of Ca²⁺ into the cells in response to depolarization of the call membrane.

An understanding of the pharmacology of compounds that interact with calcium channels in other organ systems, such as the CNS, may aid in the rational design of compounds that specifically interact with subtypes of human calcium channels to have desired therapeutic effects, such as in the treatment of neurodegenerative and cardiovascular disorders. Such understanding and the ability to rationally design therapeutically effective compounds, however, have been hampered by an inability to independently determine the types of human calcium channels and the molecular nature of individual subtypes, particularly in the CNS, and by the unavailability of pure preparations of specific channel subtypes to use for evaluation of the specificity of calcium channel-effecting compounds. Thus, identification of DNA encoding human calcium channel subunits and the use of such DNA for expression of calcium channel subunits and functional calcium channels would aid in screening and designing therapeutically effective compounds.

Therefore, it is an object herein, to provide DNA encoding specific calcium channel subunits and to provide eukaryotic cells bearing recombinant tissue-specific or subtype- specific calcium channels. It is also an object to provide assays for identification of potentially therapeutic compounds that act as calcium channel antagonists and agonists.

SUMMARY OF THE INVENTION

Eukaryotic cells containing heterologous DNA encoding one or more calcium channel subunits, particularly human calcium channel subunits, or containing RNA transcripts of DNA clones encoding one or more of the subunits are provided. In preferred embodiments, the cells contain DNA or RNA encoding a human α₁ subunit, preferably, at least an α_(1D) or α_(1B) subunit. In more preferred embodiments, the cells contain DNA or RNA encoding additional heterologous subunits, including at least one β, α₂ or γ subunits are included. In such embodiments, eukaryotic cells stably or transiently transfected with any combination of one, two, three or four of the subunit-encoding cDNA clones,,such as α₁, α₁ +β, α₁ +β+α₂, are provided. In more preferred embodiments, the subunits encoded by the heterologous DNA are human subunits.

In preferred embodiments, the cells express such heterologous calcium channel subunits and include one or more of the subunits in membrane spanning heterologous calcium channels. In more preferred embodiments, the eukaryotic cells express functional, heterologous calcium channels that are capable of gating the passage of calcium channel selective ions and/or binding compounds that, at physiological concentrations, modulate the activity of the heterologous calcium channel. In certain embodiments, the heterologous calcium channels include at least one heterologous calcium channel subunit. In most preferred embodiments, the calcium channels that are expressed on the surface of the eukaryotic cells are composed substantially or entirely of subunits encoded by the heterologous DNA or RNA. In preferred embodiments, the heterologous calcium channels of such cells are distinguishable from any endogenous calcium channels of the host cell.

In certain embodiments the recombinant eukaryotic cells that contain the heterologous DNA encoding the calcium channel subunits are produced by transfection with DNA encoding one or more of the subunits or are injected with RNA transcripts of cDNA encoding one or more of the calcium channel subunits. The DNA may be introduced as a linear DNA fragment or may be included in an expression vector for stable or transient expression of the subunit-encoding DNA. Vectors containing DNA encoding human calcium channel subunits are also provided.

The eukaryotic cells that express heterologous calcium channels may be used in assays for calcium channel function or, in the case of cells transformed with fewer subunit-encoding nucleic acids than necessary to constitute a functional recombinant human calcium channel, such cells may be used to assess the effects of additional subunits on calcium channel activity. The additional subunits can be provided by subsequently transfecting such a cell with one or more DNA clones or RNA transcripts encoding human calcium channel subunits.

The recombinant eukaryotic cells that express membrane spanning heterologous calcium channels may be used in methods for identifying compounds that modulate calcium channel activity. In particular, the cells are used in assays that identify agonists and antagonists of calcium channel activity in humans and/or assessing the contribution of the various calcium channel subunits to the transport and regulation of transport of calcium ions.

Assays using the eukaryotic cells for identifying compounds that modulate calcium channel activity are provided.

Isolated and purified DNA fragments that encode human calcium channel subunits are provided. DNA encoding a, subunits of a human calcium channel, and RNA, encoding such subunits, made upon transcription of such DNA are provided. In particular, DNA fragments encoding α₁ subunits of voltage-dependent human calcium channels (VDCCs) type A, type B (also referred to as VDCC IV), type C (also referred to as VDCC II) and type D (also referred to as VDCC III) are provided.

In particular, DNA encoding an α_(1D) subunit that includes the amino acids substantially as set forth as residues 10-2161 of sequence ID No. 1 is provided. DNA encoding an α_(1D) subunit includes substantially the amino acids set forth as amino acids 1-34 in sequence ID No. 2 in place of amino acids 373-406 of SEQ ID No. 1 is also provided. DNA encoding an α_(1C) subunit that includes the amino acids substantially as set forth in sequence ID No. 3 or sequence ID No. 6 and DNA encoding an α_(1B) subunit that includes an amino acid sequence substantially as set forth in sequence ID No. 7 or in sequence. ID No. 8 is also provided. A phage lysate of an E. coli host containing DNA encoding α_(1A) have been deposited in the American Type culture Collection, 12301 Parklawn Drive, Rockville, Md. 20852 U.S.A. under Accession No. in accord with the Budapest Treaty. The DNA in such phage includes a DNA fragment having the sequence set forth in SEQ ID No. 21. This fragment hybridizes to DNA encoding α_(1A) but not to DNA encoding α_(1B).

DNA encoding α₂ subunits of a human calcium channel, and RNA encoding such subunits, made upon transcription of such a DNA are provided. DNA encoding splice variants of the α₂ subunit, including tissue specific splice variants, are also provided. In particular, DNA encoding the α_(2a) -α_(2c) subunit subtypes is provided. In particularly preferred embodiments, the DNA encoding the α₂ subunit is produced by alternative processing of a primary transcript that includes DNA encoding the amino acids set forth in SEQ ID 11 and the DNA of SEQ ID No. 13 inserted between nucleotides 1624 and 1625 of SEQ ID No. 11. The aqueous of those splice variants which are designated α_(2b), α_(2a), and α_(2c) -α_(2e) are set forth in SEQ. ID Nos. 11 and 22-25, respectively

Isolated and purified DNA fragments encoding human calcium channel β subunits, including DNA encoding β₁ subunit splice variants and the β₃ subunit is provided. In particular, DNA encoding the β₁ and β₃ subunits, including the β₁ subunit splice variants β₁₋₁ -β₁₋₅, is provided. RNA, encoding β subunits, made upon transcription of the DNA is also provided. Escherichia. coli (E. coli) containing plasmids containing DNA encoding β₃ have been deposited in accord with the Budapest Treaty under Accession No. 69048 at the American Type Culture Collection. A partial sequence of the deposited clone is set forth in SEQ ID No. 19 (sequence from the 5' end) and SEQ ID No. 20 (sequence from the 3' end).

DNA encoding β subunits that are produced by alternative processing of a primary transcript encoding a β subunit, including a transcript that includes DNA encoding the amino acids set forth in SEQ ID No. 9 or including a primary transcript that encodes β₃ as deposited under ATCC Accession No. 69048, but lacking and including alternative exons are provided or may be constructed from the DNA provided herein. For example, DNA encoding a β subunit that is produced by alternative processing of a primary transcript that includes DNA encoding the amino acids set forth in SEQ ID No. 9, but including the DNA set forth in SEQ ID No. 12 inserted in place of nucleotides 615-781 of SEQ ID No. 9 is also provided. DNA encoding β subunits that are encoded by transcripts that have the sequence set forth in SEQ ID No. 9 including the DNA set forth in SEQ ID No. 12 inserted in place of nucleotides 615-781 of SEQ ID No. 9, but that lack one or more of the following sequences of nucleotides: nucleotides 14-34 of SEQ ID No. 12, nucleotides 13-34 of SEQ ID No. 12, nucleotides 35-55 of SEQ ID No 12, nucleotides 56-190 of SEQ ID No. 12 and nucleotides 191-271 of SEQ ID No. 12 are also provided.

DNA encoding γ subunits of human calcium channels is also provided. RNA, encoding γ subunits, made upon transcription of the DNA is also provided. In particular, DNA containing the sequence of nucleotides set forth in SEQ ID No. 14 is provided.

Full-length DNA clones and corresponding RNA transcripts, encoding the α₁, including α_(1D), α_(1B), β₂ and β subunits, including β₁₋₁ -β₁₋₅, of human calcium channels are provided. The sequences of DNA encoded the β₁ splice variants β₁₋₁, β₁₋₂, β₁₋₃, β₁₋₄ and β₁₋₅ are set forth in SEQ ID Nos. 26, 9, 10, 27 and 28. Also provided are DNA clones encoding substantial portions of the α_(1A), α_(1C), β₃ and γ subunits of voltage-dependent human calcium channels for the preparation of full-length DNA clones encoding the full-length α_(1A), α_(1C), β₃ and γ subunits.

Nucleic acid probes containing at least about 14 contiguous nucleotides of α_(1D) α_(1C), α_(1B), α_(1A), α₂, β, including β₁ splice variants and β₃ and γ subunit-encoding DNA are provided. Methods using the probes for the isolation and cloning of calcium channel subunit-encoding cDNA, including splice variants within tissues and inter-tissue variants are also provided.

Purified human calcium channel subunits and purified human calcium channels are provided. The subunits and channels can be isolated from a eukaryotic cell transfected with DNA that encodes the subunit.

In another embodiment, immunoglobulins or antibodies obtained from the serum of an animal immunized with a substantially pure preparation of a human calcium channel, human calcium channel subunit or epitope-containing fragment of a human calcium subunit are provided. Monoclonal antibodies produced using a human calcium channel, human calcium channel subunit or epitope-containing fragment thereof as an immunogen are also provided. E. coli fusion proteins including a fragment of a human calcium channel subunit may also be used as immunogen. Such fusion proteins may contain a bacterial protein or portion thereof, such as the E. coli TrpE protein, fused to a calcium channel subunit peptide. The immunoglobulins that are produced using the calcium channel subunits or purified calcium channels as immunogens have, among other properties, the ability to specifically and preferentially bind to and/or cause the immunoprecipitation of a human calcium channel or a subunit thereof which may be present in a biological sample or a solution derived from such a biological sample.

A diagnostic method for determining the presence of Lambert Eaton Syndrome (LES) in a human based on immunological reactivity of LES immunoglobulin G (IgG) with a human calcium channel subunit or a eukaryotic cell which expresses a recombinant human calcium channel or a subunit thereof is also provided. In particular, an immunoassay method for diagnosing Lambert-Eaton Syndrome in a person by combining serum or an IgG fraction from the person (test serum) with calcium channel proteins, including the α and β subunits, and ascertaining whether antibodies in the test serum react with one or more of the subunits, or a recombinant cell which expresses one or more of the subunits to a greater extent than antibodies in control serum, obtained from a person or group of persons known to be free of the Syndrome, is provided. Any immunoassay procedure known in the art for detecting antibodies against a given antigen in serum can be employed in the method.

DETAILED DESCRIPTION OF THE INVENTION

Definitions

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of skill in the art to which this invention belongs. All patents and publications referred to he rein are incorporated by reference herein.

Reference to each of the calcium channel subunits includes, the subunits that are specifically disclosed herein and human calcium channel subunits encoded by DNA that can be isolated by using the DNA disclosed as probes and screening an appropriate human cDNA or genomic library under at least low stringency. Such DNA also includes DNA that encodes proteins that have about 40% homology to any of the subunits proteins described herein or DNA that hybridizes under conditions of at least low stringency to the DNA provided herein and the protein encoded by such DNA exhibits additional identifying characteristics, such as function or molecular weight.

It is understood that subunits that are encoded by transcripts that represent splice variants of the disclosed subunits or other such subunits may exhibit less than 40% overall homology to any single subunit, but will include regions of such homology to one or more such subunits. It is also understood that 40% homology refers to proteins that share approximately 40% of their amino acids in common or that share somewhat less, but include conservative amino acid substitutions, whereby the activity of the protein is not substantially altered.

As used herein, the α₁ subunits types, encoded by different genes, are designated as type α_(1A), α_(1B), α_(1C), and α_(1D), These types may also be also referred to as VDCC IV for α_(1B), VDCC II for α_(1C) and VDCC III for α_(1D). Subunit subtypes, which are splice variants, are referred to, for example as α_(1B-1), α_(1B-2), α_(1C-1) etc.

Thus, as used herein, DNA encoding the α₁ subunit refers to DNA that hybridizes to the DNA provided herein under conditions of at least low stringency or encodes a subunit that has roughly about 40% homology to protein encoded by DNA disclosed herein that encodes an α₁ subunit of a human calcium. An α₁ subunit may be identified by its ability to form a calcium channel. Typically, α₁ subunits have molecular weights greater than at least about 120 kD. The activity of a calcium channel may be assessed in vitro by methods known to those of skill in the art, including the electrophysiological and other methods described herein. Typically, α₁ subunits include regions to which one or more modulators of calcium channel activity, such as a 1,4 DHP or ω-CgTx, interact directly of indirectly. Types of α₁ subunits may be distinguished by any method known to those of skill in the art, including on the basis of binding specificity. For example, it has been found herein that α_(1B) subunits participate in the formation of N-type channels, α_(ID) subunits participate in the formation of L-type channels, and α_(IA) subunits appear to participate in the formation of channels that exhibit characteristics typical of P-type channels. Thus, for example, the activity of channels that contain the α_(1B) subunit are insensitive to 1,4 DHPs; whereas the activity of channels that contain the α_(1D) subunit are modulated or altered by a 1,4 DHP. Types and subtypes of α₁ subunits may be characterized on the basis of the effects of such modulators on the subunit or a channel containing the subunit as well as differences in currents and current kinetics produced by calcium channels containing the subunit.

As used herein, an α₂ subunit is encoded by DNA that hybridizes to the DNA provided herein under conditions of low stringency or encodes a protein that has about 40% homology with that disclosed herein. Such DNA encodes a protein that typically has a molecular weight greater than about 120 kD, but does not form a calcium channel in the absence of an α₁ subunit, and may alter the activity of a calcium channel that contains an α₁ subunit. Subtypes of the α₂ subunit that arise as splice variants are designated by lower case letter, such as α_(2a), . . α_(2e). In addition, the α₂ subunit and the large fragment produced under reducing conditions appear to be glycosylated with at least N-linked sugars and do not specifically bind to the 1,4-DHPs and phenylalkylamines that specifically bind to the α₁ subunit. The smaller fragment, the C-terminal fragment, is referred to as the δ subunit and includes amino acids from about 946 (SEQ ID No. 11) through about the C-terminus. This fragment may dissociate from the remaining portion of α₂ when the α₂ subunit is exposed to reducing conditions.

As used herein, a β subunit is encoded by DNA that hybridizes to the DNA provided herein under conditions of low stringency or encodes a protein that has about 40% homology with that disclosed herein and is a protein that typically has a molecular weight lower than the a subunits and on the order of about 50-80 kD, does not form a detectable calcium channel in the absence of an α₁ subunit, but may alter the activity of a calcium channel that contains an a, subunit or that contains an α₁ and α₂ subunit.

Types of the β subunit that are encoded by different genes are designated with subscripts, such as β₁ and β₃. Subtypes of β suits that arise as splice variants of a particular type are designated with a numerical subscript referring to the subtype and to the variant. Such subtypes include, but are not limited to the β₁ splice variants, including β₁₋₁ -β₁₋₅.

As used herein, a γ subunit is a subunit encoded by DNA disclosed herein as encoding the γ subunit and may be isolated and identified using the DNA disclosed herein as a probe by hybridization or other such method known to those of skill in the art, whereby full-length clones encoding a γ subunit may be isolated or constructed. A γ subunit will be encoded by DNA that hybridizes to the DNA provided herein under conditions of low stringency or exhibits sufficient sequence homology to encode a protein that has about 40% homology with the γ subunit described herein.

Thus, one of skill in the art, in light of the disclosure herein, can identify DNA encoding α₁, α₂, β, γ and calcium channel subunits, including types encoded by different genes and subtypes that represent splice variants. For example, DNA probes based on the DNA disclosed herein may be used to screen an appropriate library, including a genomic or cDNA library, and obtain DNA in one or more clones that includes an open reading fragment that encodes an entire protein. Subsequent to screening an appropriate library with the DNA disclosed herein, the isolated DNA can be examined for the presence of an open reading frame from which the sequence of the encoded protein may be deduced. Determination of the molecular weight and comparison with the sequences herein should reveal the identity of the subunit as an α₁, α₂ etc. subunit. Functional assays may, if necessary, be used to determine whether the subunit is an α₁, α₂ subunit or β subunit.

For example, DNA encoding α_(1A) may be isolated by screening an appropriate library with DNA, encoding all or a portion of the human α_(1A) subunit, isolated from the phage deposited under ATCC Accession No. 75293, including screening with an oligonucleotide having the sequence set forth in SEQ ID No. 21. Similarly, DNA encoding β₃ may be isolated by screening a human cDNA library with DNA probes prepared from the plasmid β1.42 deposited under ATCC Accession No. 69048 or probes having sequences prepared according to the sequences set forth in SEQ ID Nos. 19 and 20. Any method known to those of skill in the art for isolation and identification of DNA and preparation of full-length genomic or cDNA clones, including methods exemplified herein, may be used.

The subunit encoded by isolated DNA may be identified by comparison with the DNA and amino acid sequences of the whereas subunits provided herein. Splice variants share extensive regions of homology, but include non-homologous regions, whereas subunits encoded by different genes share a uniform distribution of non-homologous sequences.

As used herein, a splice variant refers to a variant produced by differential processing of a primary transcript of genomic DNA that results in more than one type of mRNA. Splice variants may occur within a single tissue type or among tissues (tissue-specific variants). Thus, cDNA clones that encode calcium channel subunit subtypes that have regions of identical amino acids and regions of different amino acid sequences are referred to herein as "splice variants".

As used herein, a "calcium channel selective ion" is an ion that is capable of flowing through, or being blocked from flowing through, a calcium channel which spans a cellular membrane under conditions which would substantially similarly permit or block the flow of Ca²⁺. Ba²⁺ is an example of an ion which is a calcium channel selective ion.

As used herein, a compound that modulates calcium channel activity is one that affects the ability of the calcium channel to pass calcium channel selective ions or affects other detectable calcium channel features, such as current kinetics. Such compounds include calcium channel antagonists and agonists and compounds that exert their effect on the activity of the calcium channel directly or indirectly.

As used herein, a "substantially pure" subunit or protein is a subunit or protein that is sufficiently free of other polypeptide contaminants to appear homogeneous by SDS-PAGE or to be unambiguously sequenced.

As used herein, heterologous or foreign DNA and RNA are used interchangeably and refer to DNA or RNA that does not occur naturally as part of the genome in which it is present or which is found in a location or locations in the genome that differ from that in which it occurs in nature. It is DNA or RNA that is not endogenous to the cell and has been artificially introduced into the cell. Examples of heterologous DNA include, but are not limited to, DNA that encodes a calcium channel subunit and DNA that encodes RNA or proteins that mediate or alter expression of endogenous DNA by affecting transcription, translation, or other regulatable biochemical processes. The cell that expresses the heterologous DNA, such as DNA encoding the calcium channel subunit, may contain DNA encoding the same or different calcium channel subunits. The heterologous DNA need not be expressed and may be introduced in a manner such that it is integrated into the host dell genome or is maintained episomally.

As used herein, operative linkage of heterologous DNA to regulatory and effector sequences of nucleotides, such as promoters, enhancers, transcriptional and translational stop sites, and other signal sequences, refers to the functional relationship between such DNA and such sequences of nucleotides. For example, operative linkage of heterologous DNA to a promoter refers to the physical and functional relationship between the DNA and the promoter such that the transcription of such DNA is initiated from the promoter by an RNA polymerase that specifically recognizes, binds to and transcribes the DNA in reading frame.

As used herein, isolated, substantially pure DNA refers to DNA fragments purified according to standard techniques employed by those skilled in the art see, e.g., Maniatis et al. (1982) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.!.

As used herein, expression refers to the process by which nucleic acid is transcribed into mRNA and translated into peptides, polypeptides, or proteins. If the nucleic acid is derived from genomic DNA, expression may, if an appropriate eukaryotic host cell or organism is selected, include splicing of the mRNA.

As used herein, vector or plasmid refers to discrete elements that are used to introduce heterologous DNA into cells for either expression of the heterologous DNA or for replication of the cloned heterologous DNA. Selection and use of such vectors and plasmids are well within the level of skill of the art.

As used herein, expression vector includes vectors capable of expressing DNA fragments that are in operative linkage with regulatory sequences, such as promoter regions, that are capable of effecting expression of such DNA fragments. Thus, an expression vector refers to a recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant virus or other vector that, upon introduction into an appropriate host cell, results in expression of the cloned DNA. Appropriate expression vectors are well known to those of skill in the art and include those that are replicable in eukaryotic cells and/or prokaryotic cells and those that remain episomal or may integrate into the host cell genome.

As used herein, a promoter region refers to the portion of DNA of a gene that controls transcription of DNA to which it is operatively linked. The promoter region includes specific sequences of DNA that are sufficient for RNA polymerase recognition, binding and transcription initiation. This portion of the promoter region is referred to as the promoter. In addition, the promoter region includes sequences that modulate this recognition, binding and transcription initiation activity of the RNA polymerase. These sequences may be cis acting or may be responsive to trans acting factors. Promoters, depending upon the nature of the regulation, may be constitutive or regulated.

As used herein, a recombinant eukaryotic cell is a eukaryotic cell that contains heterologous DNA or RNA.

As used herein, a recombinant or heterologous calcium channel refers to a calcium channel that contains one or more subunits that are encoded by heterologous DNA that has been introduced into and expressed in a eukaryotic cells that expresses the recombinant calcium channel. A recombinant calcium channel may also include subunits that are produced by DNA endogenous to the cell. In certain embodiments, the recombinant or heterologous calcium channel may contain only subunits that are encoded by heterologous DNA.

As used herein, "functional" with respect to a recombinant or heterologous calcium channel means that the channel is able to provide for and regulate entry of calcium channel selective ions, including, but not limited to, Ca²⁺ or Ba²⁺, in response to a stimulus and/or bind ligands with affinity for the channel. Preferably such calcium channel activity is distinguishable, such as electrophysiological, pharmacological and other means known to those of skill in the art, from any endogenous calcium channel activity that in the host cell.

As used herein, a peptide having an amino acid sequence substantially as set forth in a particular SEQ ID No. includes peptides that have the same function but may include minor variations in sequence, such as conservative amino acid changes or minor deletions or insertions that do not alter the activity of the peptide. The activity of a calcium channel receptor subunit peptide refers to its ability to form functional calcium channels with other such subunits.

As used herein, a physiological concentration of a compound is that which is necessary and sufficient for a biological process to occur. For example, a physiological concentration of a calcium channel selective ion is a concentration of the calcium channel selective ion necessary and sufficient to provide an inward current when the channels open.

As used herein, activity of a calcium channel refers to the movement of a calcium selective ion through a calcium channel. Such activity may be measured by any method known to those of skill in the art, including, but not limited to, measurement of the amount of current which flows through the recombinant channel in response to a stimulus.

As used herein, a "functional assay" refers to an assay that identifies functional calcium channels. A functional assay, thus, is an assay to assess function.

As understood by those skilled in the art, assay methods for identifying compounds, such as antagonists and agonists, that modulate calcium channel activity, generally requires comparison to a control. One type of a "control" cell or "control" culture is a cell or culture that is treated substantially the same as the cell or culture exposed to the test compound except that the control culture is not exposed to the test compound. Another type of a "control" cell or "control" culture may be a cell or a culture of cells which are identical to the transfected cells except the cells employed for the control culture do not express functional calcium channels. In this situation, the response of test cell to the test compound compared to the response (or lack of response) of the receptor-negative cell to the test compound, when cells or cultures of each type of cell are exposed to substantially the same reaction conditions in the presence of the compound being assayed. For example, in methods that use patch clamp electrophysiological procedures, the same cell can be tested in the presence and absence of the test compound, by changing the external solution bathing the cell as known in the art.

Assays

Assays for identifying compounds that modulate calcium channel activity

In vitro methods for identifying compounds, such as calcium channel agonist and antagonists, that modulate the activity of calcium channels using eukaryotic cells that express heterologous human calcium channels are provided.

In particular, the assays use eukaryotic cells that express heterologous human calcium channel subunits encoded by heterologous DNA provided herein, for screening potential calcium channel agonists and antagonists which are specific for human calcium channels and particularly for screening for compounds that are specific for particular human calcium channel subtypes. Such assays may be used in conjunction with methods of rational drug design to select among agonists and antagonists, which differ slightly in structure, those particularly useful for modulating the activity of human calcium channels, and to design or select compounds that exhibit subtype- or tissue-specific calcium channel antagonist and agonist activities.

These assays should accurately predict the relative therapeutic efficacy of a compound for the treatment of certain disorders in humans. In addition, since subtype- and tissue-specific calcium channel subunits are provided, cells with tissue- specific or subtype-specific recombinant calcium channels may be prepared and used in assays for identification of human calcium channel tissue- or subtype-specific drugs.

The assays involve contacting the cell membrane of a recombinant eukaryotic cell which expresses at least one subunit of a human calcium channel, preferably at least an α₁ subunit of a human calcium channel, with a test compound and measuring the ability of the test compound to specifically bind to the membrane or alter or modulate the activity of a heterologous calcium channel on the membrane.

In preferred embodiments, the assay uses a recombinant cell which has a calcium channel containing an α₁ subunit of a human calcium channel in combination with a β-subunit of a human calcium channel and/or an α₂ subunit of a human calcium channel. Recombinant cells expressing heterologous calcium channels containing each of the α₁, β and α₂ human subunits, and, optionally, a γ subunit of a human calcium channel are especially preferred for use in such assays.

In certain embodiments, the assays for identifying compounds that modulate calcium channel activity are practiced by measuring the calcium channel activity of a eukaryotic cell having a heterologous, functional calcium channel when such cell is exposed to a solution containing the test compound and a calcium channel selective ion and comparing the measured calcium channel activity to the calcium channel activity of the same cell or a substantially identical control cell in a solution not containing the test compound. The cell is maintained in a solution having a concentration of calcium channel selective ions sufficient to provide an inward current when the channels open. Especially preferred for use, is a recombinant cell expressing calcium channels that include each of the α₁, β and α₂ human subunits, and, optionally, a γ subunit of a human calcium channel. Methods for practicing such assays are known to those of skill in the art. For example, for similar methods applied with Xenopus laevis oocytes and acetylcholine receptors, see, Mishina et al. (1985) Nature 313:364! and, with such ooocytes and sodium channels see, Noda et al. (1986) Nature 322:826-828!; For similar studies which have been carried out with the acetylcholine receptor, see, e.g., Claudio et al. (1987) Science 238:1688-1694!.

The assays thus use cells, provided herein, that express heterologous functional calcium channels and measure functionally, such as electrophysiologically, the ability of a test compound to potentiate, antagonize or otherwise modulate the magnitude and duration of the flow of calcium channel selective ions, such as Ca⁺⁺ or Ba⁺⁺, through the heterologous functional channel. The amount of current which flows through the recombinant calcium channels of a cell may be determined directly, such as electrophysiologically, or by monitoring an independent reaction which occurs intracellularly and which is directly influenced in a calcium (or other) ion dependent manner.

Any method for assessing the activity of a calcium channel may be used in conjunction with the cells and assays provided herein. For example, in one embodiment of the method for testing a compound for its ability to modulate calcium channel activity, the amount of current is measured by its modulation of a reaction which is sensitive to calcium channel selective ions and uses a eukaryotic cell which expresses a heterologous calcium channel and also contains a transcriptional control element operatively linked for expression to a structural gene that encodes an indicator protein. The transcriptional control element used for transcription of the indicator gene is responsive in the cell to a calcium channel selective on, such as Ca²⁺ and Ba⁺. The details of such transcriptional based assays are described in commonly owned PCT International Patent Application No. PCT/US91/5625, filed Aug, 7, 1991, which claims priority to copending commonly owned U.S. application Ser. No. 07/ 563,751, filed Aug. 7, 1990, the contents of which applications are herein incorporated by reference thereto.

Assays for diagnosis of LFS

LES is an autoimmune disease characterized by an insufficient release of acetylcholine from motor nerve terminals which normally are responsive to nerve impulses. Immunoglobulins (IgG) from LES patients block individual voltage-dependent calcium channels and thus inhibit calcium channel activity Kim and Neher, Sciences 239:405-408 (1988)!. A diagnostic assay for Lambert Eaton Syndrome (LES) is provided herein. The diagnostic assay for LES relies on the immunological reactivity of LES IgG with the human calcium channels or particular subunits alone or in combination or expressed on the surface of recombinant cells. For example, such an assay may be based on immunoprecipitation of LES IgG by the human calcium channel subunits and cells that express such subunits provided herein.

Identification and isolation of DNA encoding human calcium channel subunits

Methods for identifying and isolating DNA encoding α₁, α₂, β and γ subunits of human calcium channels are provided.

Identification and isolation of such DNA may be accomplished by hybridizing, under appropriate conditions, at least low stringency whereby DNA that encodes the desired subunit is isolated, restriction enzyme-digested human DNA with a labeled probe having at least 14 nucleotides and derived from any contiguous portion of DNA having a sequence of nucleotides set forth herein by sequence identification number. Once a hybridizing fragment is identified in the hybridization reaction, it can be cloned employing standard cloning techniques known to those of skill in the art. Full-length clones may be identified by the presence of a complete open reading frame and the identity of the encoded protein verified by sequence comparison with the subunits provided herein and by functional assays to assess calcium channel forming ability or other function. This method can be used to identify genomic DNA encoding the subunit or cDNA encoding splice variants of human calcium channel subunits generated by alternative splicing of the primary transcript of genomic subunit DNA. For instance, DNA, cDNA or genomic DNA, encoding a calcium channel subunit may be identified by hybridization to a DNA probe and characterized by methods known to those of skill in the art, such as restriction mapping and DNA sequencing, and compared to the DNA provided herein in order to identify heterogeneity or divergence in the sequences the DNA. Such sequence differences may indicate that the transcripts from which the cDNA was produced result from alternative splicing of a primary transcript, if the non-homologous and homologous regions are clustered, or from a different gene if the non-homologous regions are distributed throughout the cloned DNA.

Any suitable method for isolating genes using the DNA provided herein may be used. For example, oligonucleotides corresponding to regions of sequence differences have been used to isolate, by hybridization, DNA encoding the full-length splice variant and can be used to isolate genomic clones. A probe, based on a nucleotide sequence disclosed herein, which encodes at least a portion of a subunit of a human calcium channel, such as a tissue-specific exon, may be used as a probe to clone related DNA, to clone a full-length cDNA clone or genomic clone encoding the human calcium channel subunit.

Labeled, including, but not limited to, radioactively or enzymatically labeled, RNA or single-stranded DNA of at least 14 substantially contiguous bases, preferably at least 30 contiguous bases of a nucleic acid which encodes at least a portion of a human calcium channel subunit, the sequence of which nucleic acid corresponds to a segment of a nucleic acid sequence disclosed herein by reference to a SEQ ID No. are provided. Such nucleic acid segments may be used as probes in the methods provided herein for cloning DNA encoding calcium channel subunits. See, generally, Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory Press.

In addition, nucleic acid amplification techniques, which are well known in the art, can be used to locate splice variants of calcium channel subunits by employing oligonucleotides based on DNA sequences surrounding the divergent sequence primers for amplifying human RNA or genomic DNA. Size and sequence determinations of the amplification products can reveal splice variants. Furthermore, isolation of human genomic DNA sequences by hybridization an yield DNA containing multiple exons, separated by introns, that correspond to different splice variants of transcripts encoding human calcium channel subunits.

DNA encoding types and subtypes of each of the α₁, α₂, β and γ subunit of voltage-dependent human calcium channels have been cloned herein by screening human cDNA libraries prepared from isolated poly A+ mRNA from cell lines or tissue of human origin having such calcium channels. Among the sources of such cells or tissue for obtaining ERA are human brain tissue or a human cell line of neural origin, such as a neuroblastoma cell line, human skeletal muscle or smooth muscle cells, and the like. Methods of preparing cDNA libraries are well known in the art see generally Ausubel et al. (1987) Current Protocols in Molecular Biology, Wiley-Interscience, New York; and Davis et al. (1986) Basic Methods in Molecular Biology, Elsevier Science Publishing Co., New York!.

With respect to each of the respective subunits of a human calcium channel (α₁, α₂, β or γ), once the DNA encoding the channel subunit was identified by a nucleic acid screening method, the isolated clone was used for further screening to identify overlapping clones. Some of the cloned DNA fragments can and have been subcloned into an appropriate vector such as pIBI24/25 (IBI, New Haven, Conn.), M13mp18/19, pGEM4, pGEM3, pGEM7Z, pSP72 and other such vectors known to those of skill in this art, and characterized by DNA sequencing and restriction enzyme mapping. A sequential series of overlapping clones may thus be generated for each of the subunits until a full-length clone can be prepared by methods, known to those of skill in the art, that include identification of translation initiation (start), and translation termination (stop) codons. For expression of the cloned DNA, the 5' noncoding region and other transcriptional and translational control regions of such a clone may be replaced with an efficient ribosome binding site and other regulatory regions as known in the art. Examples II-VI, below, describe in detail the cloning of each of the various subunits of a human calcium channel as well as subtypes and splice variants, including tissue-specific variants thereof. In the instances in which partial sequences of a subunit are disclosed, it is well within the skill of the art, in view of the teaching herein, to obtain the corresponding full-length nucleotide sequence encoding the subunit, subtype or splice variant thereof.

Identification and isolation of DNA encoding α₁ subunits

A number of voltage-dependent calcium channel α₁ subunit genes, which are expressed in the human CNS, have been identified and have been designated as α_(1A), α_(1B) (or VDCC IV), α_(1C) (or VDCC II) and α_(ID) (or VDCC III). DNA, isolated from a human neuronal cDNA library, that encodes each of the subunit types has been isolated. DNA encoding subtypes of each of the types, which arise as splice variants are also provided. Subtypes are herein designated, for example, as α_(1B-1), α_(1B-2).

The α₁ subunits types A B, C, and D of voltage-dependent calcium channels, and subtypes thereof, differ with respect to sensitivity to known classes of calcium channel agonists and antagonists, such as DHPs, phenylalkylamines, omega conotoxin (ω-CgTx) and pyrazonoylguanidines. They also appear to differ in the holding potential and ion the kinetics of currents produced upon depolarization of cell membranes containing calcium channels that include different types of α₁ subunits.

DNA that encodes an α₁ -subunit that binds to at least one compound selected from among dihydropyridines, phenylalkylamines, ω-CgTx, components of funnel web spider toxin, and pyrazonoylguanidines is provided. For example, the α_(1B) subunit provided herein appears to specifically interact with ω-CgTx in N-type channels, and the α_(1D) subunit provided herein specifically interacts with DHPs in L-type channels.

Identification and isolation of DNA encoding the α_(ID) human calcium channel subunit

The α_(ID) subunit cDNA has been isolated using fragments of the rabbit skeletal muscle calcium channel α₁ subunit cDNA as a probe to screen a cDNA library of a human neuroblastoma cell line, IXR32, to obtain clone α1.36. This clone was used as a probe to screen additional IMR32 cell cDNA libraries to obtain overlapping clones, which were then employed for screening until a sufficient series of clones to span the length of the nucleotide sequence encoding the human. α_(ID) subunit were obtained. Full-length clones encoding α_(ID) were constructed by ligating portions of partial α_(ID) clones as described in Example II. SEQ ID No. 1 shows the 7,635 nucleotide sequence of the cDNA encoding the α_(ID) subunit. There is a 6,483 nucleotide sequence reading frame which encodes a sequence of 2,161 amino acids (as set forth in SEQ ID No. 1).

SEQ ID No. 2 provides the sequence of an alternative exon encoding the IS6 transmembrane domain see Tanabe, T., et al. (1987) Nature 328:313-318 for a description of transmembrane domain terminology) of the α_(ID) subunit.

SEQ ID No. 1 also shows the 2,161 amino acid sequence deduced from the human neuronal calcium channel α_(ID) subunit DNA. Based on the amino acid sequence, the α_(ID) protein has a calculated Mr of 245,163. The α_(ID) subunit of the calcium channel contains four putative internal repeated sequence regions. Four internally repeated regions represent 24 putative transmembrane segments, and the amino- and carboxyl-termini extend intracellularly.

The α_(1D) subunit has been shown to madiate DHP-sensitive, high-voltage-activated, long-lasting calcium channel activity. This calcium channel activity was detected when oocytes were co-injected with. RNA transcripts encoding an α_(1D) and β₁ or α_(1D), α₂ and β₁ subunits. This activity was distinguished from Ba²⁺ currents detected when oocytes were injected with RNA transcripts encoding the β₁ ±α₂ subunits. These currents pharmacologically and biophysically resembled Ca²⁺ currents reported for uninjected oocytes.

Identification and isolation DNA encoding the α_(1A) human calcium channel subunit

Biological material containing DNA encoding the α_(1A) subunit had been deposited in the American Type Culture Collection, 12301 Parklawn Drive, Rockville, Md. 20852 U.S.A. under the terms of the Budapest treaty on the International Recognition of Deposits of Microorganisms for Purposes of Patent Procedure and the Regulations promulgated under this Treaty. Samples of the deposited material are and will be available to industrial property offices and other persons legally entitled to receive them under the terms of said Treaty and Regulations and otherwise in compliance with the patent laws and regulations of the United States of America and all other nations or international organizations in which this application, or an application claiming priority of this application, is filed or in which any patent granted on any such application is granted.

The α₁, subunit is encoded by an approximately 3 kb insert in λgt10 phage designated α1.254 in E. coli host strain NM514. A phage lysate of this material has been deposited as at the American Type Culture Collection under ATCC Accession No. 75293, as described above. DNA encoding α_(1A) may also be identified by screening with a probe prepared from DNA that has SEQ ID No. 21:

    5' CTCAGTACCATCTCTGATACCAGCCCCA 3'.

Identification and isolation of DNA encoding the α_(1B) human calcium channel subunit

DNA encoding the α_(1B) subunit was isolated by screening a human basal ganglia cDNA library with fragments of the rabbit skeletal muscle calcium channel α₁ subunit-encoding cDNA. A portion of one of the positive clones was used to screen an IMR32 cell cDNA library. Clones that hybridized to the basal ganglia DNA probe were used to further screen an IMR32 cell cDNA library to identify overlapping clones that in turn were used to screen a human hippocampus cDNA library. In this way, a sufficient series of clones to span nearly the entire length of the nucleotide sequence encoding the human α_(1B) subunit was obtained. PCR amplification of specific regions of the IMR32 cell α_(1B) mRNA yielded additional segments of the α_(1B) coding sequence.

A full-length α_(1B) DNA clone was constructed by ligating portions of the partial cDNA clones as described in Example II.C. SEQ ID Nos. 7 and 8 show the nucleotide sequences of DNA clones encoding the α_(1B) subunit as well as the deduced amino acid sequences. The α_(1B) subunit encoded by SEQ ID No. 7 is referred to as the α_(1B-1) subunit to distinguish it from another α_(1B) subunit α_(1b-2), encoded by the nucleotide sequence shown as SEQ ID No. 8, which is derived from alternative splicing of the α_(1B) subunit transcript.

PCR amplification of IMR32 cell mRNA using oligonucleotide primers designed according to nucleotide sequences within the α_(1B-1) -encoding DNA has identified variants of the α_(1B) transcript that appear to be splice variants because they contain divergent coding sequences.

Identification and isolation of DNA encoding the α_(1C) human calcium channel subunit

Numerous α_(1C) -specific DNA clones were isolated. Characterization of the sequence revealed the α_(1C) coding sequence, the α_(1C) initiation of translation sequence, and an alternatively spliced region of α_(1C). Alternatively spliced variants of the α_(1C) subunit have been identified. SEQ ID No. 3 sets forth DNA encoding an α_(1C) subunit. The DNA sequences set forth in SEQ ID No. 4 and No. 5 encode two possible amino terminal ends of the α_(1C) protein. SEQ ID No. 6 encodes an alternative exon for the IV S3 transmembrane domain.

The isolation and identification of DNA clones encoding portions of the α_(1C) subunit is described in detail in Example II.

DNA encoding other α₁ subunits, including α_(1A), has also been isolated. Additional such subunits may also be, isolated and identified using the DNA provided herein as described for the α_(1B), α_(1C) and α_(1D) subunits or using other methods known to those of skill in the art.

Identification and isolation DNA encoding β human calcium channel subunits DNA encoding β₁

To isolate DNA encoding the β₁ subunit, a human hippocampus cDNA library was screened by hybridization to a DNA fragment encoding a rabbit skeletal muscle calcium channel β subunit. A hybridizing clone was selected and was in turn used to isolate overlapping clones until the overlapping clones encompassing DNA encoding the entire the human calcium channel β subunit were isolated and sequenced.

Five alternatively spliced forms of the human calcium channel β₁ subunit have been identified and DNA encoding a number of forms have been isolated. These forms are designated β₁₋₁, expressed in skeletal muscle, β₁₋₂, expressed in the CNS, β₁₋₃, also expressed in the in the CNS, β₁₋₄, expressed in aorta tissue and HEK 293 cells, and β₁₋₅, expressed in HEK 293 cells. A full-length DNA clone encoding the β1-2 subunit has been constructed. The subunits β₁₋₁, β₁₋₂, β₁₋₄ and β₁₋₅ have been identified by PCR analysis as alternatively spliced forms of the β subunit.

The alternatively spliced variants were identified by comparison of amino acid sequences encoded by the human neuronal and rabbit skeletal muscle calcium channel β subunit-encoding DNA. This comparison revealed a 45-amino acid deletion in the human β subunit compared to the rabbit β subunit. Using DNA from the region as a probe for DNA cloning, as well as PCR analysis and DNA sequencing of this area of sequence divergence, alternatively spliced forms of the human calcium channel β subunit transcript were identified. For example, the sequence of DNA encoding one splice variant β₁₂ is set forth in SEQ ID No. 9. SEQ ID No. 10 sets forth the sequence of the β₁₋₃ subunit (nt 1-1851, including 3' untranslated sequence nt 1795-1851), which is another splice variant of the β subunit primary transcript. β₁₋₂ and β₁₋₃ are human neuronal β subunits. DNA distinctive for a portion of a β subunit (β₁₋₄) of a human aortic calcium channel and also human embryonic kidney (HEK) cells is set forth in SEQ ID No. 12 (nt 1-13 and 191-271). The sequence of DNA encoding a portion of a human calcium channel β subunit expressed in skeletal muscle (β₁₋₁) is shown in SEQ ID No. 12 (nt 1-13 and 35-271).

DNA encoding β₃

DNA encoding the β₃ subunit and any splice variants thereof may be isolated by screening a library, as described above for the β₁ subunit, using DNA probes prepared according to SEQ ID Nos. 19 and 20 or using all or a portion of the deposited 3 clone plasmid β1.42 (ATCC Accession No. 69048).

The E. coli host containing plasmid β1.42 that includes DNA encoding the β₃ subunit has been deposited as ATCC Accession No. 69048 in the American Type Culture Collection, 12301 Parklawn Drive, Rockville, Md. 20852 U.S.A. under the terms of the Budapest Treaty on the International Recognition of Deposits of Microorganisms for Purposes of Patent Procedure and the Regulations promulgated under this Treaty. Samples of the deposited material are and will be available to industrial property offices and other persons legally entitled to receive them under the terms of said Treaty and Regulations and otherwise in compliance with the patent laws and regulations of the United States of America and all other nations or international organizations in which this application, or an application claiming priority of this application, is filed or in which any patent granted on any such application is granted.

The β₃ encoding plasmid is designated β1.42. The plasmid contains a 2.5 kb EcoRI fragment encoding β₃ inserted into vector pGem7zF(+) and has been deposited in E. coli host strain DN5α. A partial DNA sequence of the 5' and 3' ends of β₃ are set forth in SEQ ID Nos. 19 and 20, respectively.

Identification and isolation DNA encoding the α₂ human calcium channel subunit

DNA encoding a human neuronal calcium channel α₂ subunit was isolated in a manner substantially similar to that used for isolating DNA encoding an α₁ subunit, except that a human genomic DNA library was probed under low and high stringency conditions with a fragment of DNA encoding the rabbit skeletal muscle calcium channel α₂ subunit. The fragment included nucleotides having, a sequence corresponding to the nucleotide sequence between nucleotides 43 and 272 inclusive of rabbit back skeletal muscle calcium channel α₂ subunit cDNA as disclosed in PCT International Patent Application Publication No. WO 89/09834, which corresponds to U.S. application Ser. No. 07/620,520, which is a continuation-in-parts of U.S. Ser. No. 176,899, filed Apr. 4, 1988, which applications have been incorporated herein by reference.

Example IV describes the isolation of DNA clones encoding α₂ subunits of a human calcium channel from a human DNA library using genomic DNA and cDNA clones, identified by hybridization to the genomic DNA, as probes.

SEQ ID No. 11 shows the sequence of DNA encoding an 2 subunit. As described in Example V, PCR analysis of RNA from human skeletal muscle, brain tissue and aorta using oligonucleotide primers specific for a region of the human neuronal α₂ subunit cDNA that diverges from the rabbit skeletal muscle calcium channel β2 subunit cDNA identified splice variants of the human calcium channel α₂ subunit transcript.

Identification and isolation of DNA encoding γ human calcium channel subunits

DNA encoding a human neuronal calcium channel γ subunit has been isolated as described in detail in Example VI. SEQ ID No. 14 shows the nucleotide sequence at the 3'-end of this DNA which includes a reading frame encoding a sequence of 43 amino acid residues.

Preparation of recombinant eukaryotic cells containing DNA encoding heterologous calcium channel subunits

DNA encoding one or more of the calcium channel subunits or a portion of a calcium channel subunit may be introduced into a host cell for expression or replication of the DNA. Such DNA may be introduced using methods described in the following examples or using other procedures well known to those skilled in the art. Incorporation of cloned DNA into a suitable expression vector, transfection of eukaryotic cells with a plasmid vector or a combination of plasmid vectors, each encoding one, or more distinct genes or with linear DNA, and selection of transfected cells are also well known in the art see, e.g., Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press!.

Cloned full-length DNA encoding any of the subunits of a human calcium channel may be introduced into a plasmid vector for expression in a eukaryotic cell. Such DNA may be genomic DNA or cDNA. Host cells may be transfected with one or a combination of said plasmids, each of which encodes at least one calcium channel subunit. Alternatively, host cells may be transfected with linear DNA using methods well known to those of skill in the art.

While the DNA provided herein may be expressed in any eukaryotic cell, including yeast cells such as P. pastoris see, e.g., Cregg et al. (1987) Bio/Technology 5:479!, mammalian expression systems for expression of the DNA encoding the human calcium channel subunits provided herein are preferred.

The heterologous DNA may be introduced by maintaining method known to those of skill in the art, such as transfection with a vector encoding the heterologous DNA. Particularly preferred vectors for transfection of mammalian cells are the pSV2dhfr expression vectors, which contain the SV40 early promoter, mouse dhfr gene, SV40 polyadenylation and splice sites and sequences necessary for maintaining the vector in bacteria, cytomegalovirus (CMV) promoter-based vectors such as pCMV or pCDNA1, and MMTV promoter-based vectors. DNA encoding the human calcium channel subunits has been inserted in the vector pCDNA1 at a position immediately following the CMV promoter.

Stably or transiently transfected mammalian cells may be prepared by methods known in the art by transfecting cells with an expression vector having a selectable marker gene such as the gene for thymidine kinase, dihydrofolate reductase, neomycin resistance or the like, and, for transient transfection, growing the transfected cells under conditions selective for cells expressing the marker gene. Functional voltage-dependent calcium channels have been produced in HEK 293 cells transfected with a derivative of the vector pCDNA1 that contains DNA encoding a human calcium channel subunit.

The heterologous DNA may be maintained in the cell as an episomal element or may be integrated into chromosomal DNA of the cell. The resulting recombinant cells may then be cultured or subcultured (or passaged, in the case of mammalian cells) from such a culture or a subculture thereof. Methods for transfection, injection and culturing recombinant cells are known to the skilled artisan. Eukaryotic cells in which DNA or RNA may be introduced, include any cells that are transfectable by such DNA or RNA or into which such DNA may be injected. Virtually any eukaryotic cell can serve as a vehicle for heterologous DNA. Preferred cells are those that can also express the DNA and RNA and most preferred cells are those that can form recombinant or heterologous calcium channels that include one or more subunits encoded by the heterologous DNA. Such cells may be identified empirically or selected from among those known to be readily transfected or injected. Preferred cells for introducing DNA include those that can be transiently or stably transfected and include, but are not limited to cells of mammalian origin, such as CCS cells, mouse L cells, CHO cells, human embryonic kidney cells, African green morkey cells and other such cells known to those of skill in the art, amphibian cells, such as Xenopus laevis oocytes, or those of yeast such as Saccharomyces cerevisiae or Pichia pastoris. Preferred cells for expressing injected RNA transcripts include Xenopus laevis oocytes. Cells that are preferred for transfection of DNA are those that can be readily and efficiently transfected. Such cells are known to those of skill in the art or may be empirically identified. Preferred cells include DG44 cells and HEK 293 cells, particularly HEK 293 cells that have been adapted for growth in suspension and that can be frozen in liquid nitrogen and then thawed and regrown. Such HEK 293 cells are described, for example in U.S. Pat. No. 5,024,939 to Gorman see, also Stillman et al. (1985) Mol. Cell.Biol. 5:2051-2060!.

The cells may be used as vehicles for replicating heterologous DNA introduced therein or for expressing the heterologous DNA introduced therein. In certain embodiments, the cells are used as vehicles for expressing the heterologous DNA as a means to produce substantially pure human calcium channel subunits or heterologous calcium channels. Host cells containing the heterologous DNA may be cultured under conditions whereby the calcium channels are expressed. The calcium channel subunits may be purified using protein purification methods known to those of skill in the art. For example, antibodies, such as those provided herein, that specifically bind to one or more of the subunits may be used for affinity purification of the subunit or calcium channels containing the subunits.

Substantially pure subunits of a human calcium channel α₁ subunits of a human calcium channel, α₂ subunits of a human calcium channel, β subunits of a human calcium channel and γ subunits of a human calcium channel are provided. Substantially pure isolated calcium channels that contain at least one of the human calcium channel subunits are also provided. Substantially pure calcium channels that contain a mixture of one or more subunits encoded by the host cell and one or more subunits encoded by heterologous DNA or RNA that has been introduced into the cell are also provided. Substantially pure subtype- or tissue-type specific calcium channels are also provided.

In other embodiments, eukaryotic cells that contain heterologous DNA encoding at least one of an α₁ subunit of a human calcium channel , an α₂ subunit of a human calcium channel, a β subunit of a human calcium channel and a γ subunit of a human calcium channel are provided. In accordance with one preferred embodiment, the heterologous DNA is expressed in the eukaryotic cell and preferably encodes a human calcium channel α₁ subunit.

In particularly preferred aspects, the eukaryotic cell which contains the heterologous DNA expresses it and forms a recombinant functional calcium channel activity. In more preferred aspects, the recombinant calcium channel activity is readily detectable because it is a type that is absent from the untransfected host cell or is of a magnitude not exhibited in the untransfected cell.

Preferred among such cells is a recombinant eukaryotic cell with a functional heterologous calcium channel. The recombinant cell can be produced by introduction of and expression of heterologous DNA or RNA transcripts encoding an α₁ subunit of a human calcium channel, more preferably also expressing, a heterologous DNA encoding a β subunit of a human calcium channel and/or heterologous DNA encoding an β₂ subunit of a human calcium channel. Especially preferred is the expression in such a recombinant cell of each of the α₁, β and α₁ subunits encoded by such heterologous DNA or RNA transcripts, and optionally expression of heterologous DNA or an RNA transcript encoding a γ subunit of a human calcium channel.

In certain embodiments, the eukaryotic cell with a heterologous calcium channel is produced by introducing into the cell a first composition, which contains at least one RNA transcript that is translated in the cell into a subunit of a human calcium channel. In preferred embodiments, the subunits that are translated include an α₁ subunit of a human calcium channel. More preferably, the composition that is introduced contains an RNA transcript which encodes an α₁ subunit of a human calcium channel and also contains (1) an RNA transcript which encodes a β subunit of a human calcium channel and/or (2) an RNA transcript which encodes an α₂ subunit of a human calcium channel. Especially preferred is the introduction of RNA encoding an α₁, a β and an α₂ human calcium channel subunit, and, optionally, a γ subunit of a human calcium channel.

Methods for in vitro transcription of a cloned DNA and injection of the resulting RNA into eukaryotic cells are well known in the art. Transcripts of any of the full-length DNA encoding any of the subunits of a human calcium channel may be injected alone or in combination with other transcripts into eukaryotic cells for expression in the cells. Amphibian oocytes are particularly preferred for expression of in vitro transcripts of the human calcium channel subunit cDNA clones provided herein.

The functional calcium channels may preferably include at least an α₁ subunit and a β subunit of a human calcium channel. Eukaryotic cells expressing these two subunits and also cells expressing additional subunits, have been prepared by transfection of DNA and by injection of RNA transcripts. Such cells have exhibited voltage-dependent calcium channel activity attributable to calcium channels that contain one or more of the heterologous human calcium channel subunits. For example, eukaryotic cells expressing heterologous calcium channels containing an α₂ subunit in addition to the α₁ subunit and a β subunit have been shown to exhibit increased calcium selective ion flow across the cellular membrane in response to depolarization, indicating that the α₂ subunit may potentiate calcium channel function.

Eukaryotic cells which express heterologous calcium channels containing at least a human α₁ subunit, a human β subunit and a human β₂ subunit are preferred. Eukaryotic cells transformed with a composition containing cDNA or an RNA transcript that encodes an α₁ subunit alone or in combination with a β and/or an α₂ subunit may be used to produce cells that express functional calcium channels. Since recombinant cells expressing human calcium channels containing all of the of the human subunits encoded by the heterologous cDNA or RNA are especially preferred, it is desirable to inject or transfect such host cells with a sufficient concentration of the subunit-encoding nucleic acids to form calcium channels that contain the human subunits encoded by heterologous DNA or RNA. The precise amounts and ratios of DNA or RNA encoding the subunits may be empirically determined and optimized for a particular combination of subunits, cells and assay conditions.

With respect to measurement of the activity of functional heterologous calcium channels, preferably, endogenous ion channel activity and, if desired, heterologous channel activity of channels that do not contain the desired subunits, of a host cell can be inhibited to a significant extent by chemical, pharmacological and electrophysiological means, including the use of differential holding potential, to increase the S/N ratio of the measured heterologous calcium channel activity.

Among the uses for eukaryotic cells which recombinantly express one or more subunits are assays for determining whether a test compound has calcium channel agonist or antagonist activity. Desirably, a host cell for the expression of calcium channel subunits does not produce endogenous calcium channel subunits of the type or in an amount that substantially interferes with the detection of heterologous calcium channel subunits in ligand binding assays or detection of heterologous calcium channel function, such as generation of calcium current, in functional assays.

With respect to ligand binding assays, the host cells preferably should not produce endogenous calcium channels which detectably interact with compounds having, at physiological concentrations (generally nanomolar or picomolar concentrations), affinity for calcium channels that contain one or all of the human calcium channel subunits provided herein.

With respect to ligand binding assays for identifying a compound which has affinity for calcium channels, cells are employed which express, preferably, at least a heterologous α₁ subunit. Transfected eukaryotic cells which express at least an α₁ subunit may be used to determine the ability of a test compound to specifically alter the activity of a calcium channel. Such ligand binding assays may be performed on intact transfected cells or membranes prepared therefrom.

The capacity of a test compound to bind to or otherwise interact with membranes that contain heterologous calcium channels or subunits thereof may be determined by using any appropriate method, such as competitive binding analysis, such as Scatchard plots, in which the binding capacity of such membranes is determined in the presence and absence of one or more concentrations of a compound having known affinity for the calcium channel. Where necessary, the results may be compared to a control experiment designed in accordance with methods known to those of skill in the art. For example, as a negative control, the results may be compared to those of assays of an identically treated membrane preparation from host cells which have not been transfected with one or more subunit-encoding nucleic acids.

Stably or transiently transfected cells or injected cells which express voltage-dependent human calcium channels containing one or more of the subunits of a human calcium channel desirably may be used in assays to identify agents, such as calcium channel agonists and antagonists, that modulate calcium channel activity. Functionally testing the activity of test compounds, including compounds having unknown activity, for calcium channel agonist or antagonist activity to determine if the test compound potentiates, inhibits or otherwise alters the flow of calcium through a human calcium channel can be accomplished by (a) maintaining a eukaryotic cell which is transfected or injected to express a heterologous functional calcium channel capable of regulating the flow of calcium-channel selective ions into the cell in a medium containing calcium channel selective ions (i) in the presence of and (ii) in the absence of a test compound; (b) maintaining the cell under conditions such that the heterologous calcium channels are substantially closed and endogenous calcium channels of the cell are substantially inhibited (c) depolarizing the membrane of the cell maintained in step (b) to an extent and for an amount of time sufficient to cause (preferably, substantially only) the heterologous calcium channels to become permeable to the calcium channel selective ions; and (d) comparing the amount and duration of current flow into the cell in the presence of the test compound to that of the current flow into the cell, or a substantially similar cell, in the absence of the test compound.

Functional recombinant or heterologous calcium channels may be identified by any method known to those of skill in the art. For example, electrophysiological procedures for measuring the currently across an ion-selective membrane of a cell, which are well known, may be used. The amount and duration of the flow of calcium selective ions through heterologous calcium channels of a recombinant cell containing DNA encoding one or more of the subunits provided herein has been measured using electrophysiological recordings using a two electrode and the whole-cell patch clamp techniques. In order to improve the sensitivity of the assays, known methods can be used to eliminate or reduce non-calcium currents and calcium currents resulting from endogenous calcium channels, when measuring calcium currents through recombinant channels. For example, the DHP Bay K 8644 specifically enhances L-type calcium channel function by increasing the duration of the open state of the channels see, e.g., Hess, J. B., et al. (1984) Nature 311:538-544!. Prolonged opening of the channels results in calcium currents of increased magnitude and duration. Tail currents can be observed upon repolarization of the cell membrane after activation of ion channels by a depolarizing voltage command. The opened channels require a finite time to close or "deactivate" upon repolarization, and the current that flows through the channels during this period is referred to as a tail current. Because Bay K 8644 prolongs opening events in calcium channels, it tends to prolong these tail currents and make them more pronounced.

EXAMPLES

The following examples are included for illustrative purposes only and are not intended to limit the scope of the invention.

EXAMPLE I PREPARATION OF LIBRARIES USED FOR ISOLATION OF DNA ENCODING HUMAN NEURONAL VOLTAGE-DEPENDENT CALCIUM CHANNEL SUBUNITS

A. RNA Isolation

1. IMR32 cells

IMR32 cells were obtained from the American Type Culture Collection (ATCC Accession No. CCL127, Rockville, Md.) and grown in DMEM, 10% fetal bovine serum, 1% penicillin/streptomycin (GIBCO, Grand Island, N.Y.) plus 1.0 mM dibutyryl cAMP (dbcAMP) for ten days. Total RNA was isolated from the cells according to the procedure described by H. C. Birnboim (1988) Nucleic Acids Research 16:1487-1497!. Poly(A⁺) RNA was selected according to standard procedures see, e.g., Sambrook et al. (1989) is Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press; pg. 7.26-7.29!.

2. Human thalamus tissue

Human thalamus tissue. (2.34 g), obtained from the National Neurological Research Bank, Los Angeles, Calif., that had been stored frozen at -70° C. was pulverized using a mortar and pestle in the presence of liquid nitrogen and the cells were lysed in 12 ml of lysis buffer (5M guanidinium isothiocyanate, 50 mM TRIS, pH 7.4, 10 mM EDTA, 5% β-mercaptoethanol). Lysis buffer was added to the lysate to yield a final volume of 17 ml. N-laurylsarcosine and CsCl were added to the mixture to yield final concentrations of 4% and 0.01 g/ml, respectively, in a final volume of 18 ml.

The sample was centrifuged at 9,000 rpm in a Sorvall SS34 rotor for 10 min at room temperature to remove the insoluble material as a pellet. The supernatant was divided into two equal portions and each was layered onto a 2-ml cushion of a solution of 5.7M CsCl, 0.1M EDTA contained in separate centrifuge tubes to yield approximately 9 ml per tube. The samples were centrifuged in an SW41 rotor at 37,000 rpm for 24 h at 20° C.

After centrifugation, each RNA pellet was resuspended in 3 ml ETS (10 mM TRIS, pH 7.4, 10 mM EDTA, 0.2% SDS) and combined into a single tube. The RNA was precipitated with 0.25M NaCl and two volumes of 95% ethanol.

The precipitate was collected by centrifugation and resuspended in 4 ml PK buffer (0.05M TRIS, pH 8.4, 0.14M NaCl, 0.01M EDTA, 1% SDS). Proteinase K was added to the sample to a final concentration of 200 μg/ml. The sample was incubated at 22° C. for 1 h, followed by extraction with an equal volumes of phenol:chloroform:isoamylalcohol (50:48:2) two times, followed by one extraction with an equal volume of chloroform: isoamylalcohol (24:1). The RNA was precipitated with ethanol and NaCl. The precipitate was resuspended in 400 μl of ETS buffer. The yield of total RNA was approximately 1.0 mg. Poly A⁺ RNA (3C μg) was isolated from the total RNA according to standard methods as stated in Example I.A.1.

B. Library construction

Double-stranded cDNA was synthesized according to standard methods see, e.g., Sambrook et al. (1989) IN: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, Chapter 8!. Each library was prepared in substantially the same manner except for differences in: 1) the oligonucleotide used to prime the first strand cDNA synthesis, 2) the adapters that were attached to the double-stranded cDNA, 3) the method used to remove the free or unused adapters, and 4) the size of the fractionated cDNA ligated into the λ phage vector.

1. IMR32 cDNA library #1

Single-stranded cDNA was synthesized using IMR32 poly(A⁺) RNA (Example I.A.1.) as a template and was primed using oligo (dT)₁₂₋₁₈ (Collaborative Research Inc., Bedford, Mass.). The single-stranded cDNA was converted to double-stranded cDNA and the yield was approximately 2μg. Ecol adapters:

    5'-AATTCGGTACGTACACTCGAGc-3'-22-mer (SEQ ID No.15)

    3'- GCCATGCATGTGAGCTCG-5'-18-mer (SEQ ID No.16)

also containing SnaBI and XhoI restriction sites were then added to the double-stranded cDNA according to the following procedure.

a. Phosphorylation of 18-mer

The 18-mer was phosphorylated using standard methods see, e.g., Sambrook et al. (1989) IN: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, Chapter 8! by combining in a 10 μl total volume the 18 mer (225 pmoles) with ³² P!γ-ATP (7000 Ci/mmole; 1.0 μl) and kinase (2 U) and incubating at 37° C. for 15 minutes. After incubation, 1 μL 10 mM ATP and an additional 2 U of kinase were added and incubated at 37° C. for 15 minutes. Kinase was then inactivated by boiling for 10 minutes.

b. Hybridization of 22-mer

The 22-mer was hybridized to the phosphorylated 18-mer by addition of 225 pmoles of the 22-mer (plus water to bring volume to 15 μl), and incubation at 65° C. for 5 minutes. The reaction was then allowed to slow cool to room temperature.

The adapters were thus present at a concentration of 15 pmoles/μl, and were ready for cDNA-adapter ligation.

c. Ligation of adapters to cDNA

After the EcoRI, SnaBI, XhoI adapters were ligated to the double-stranded cDNA using a standard protocol see, e.g., Sambrook, et al. (1989) IN: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, Chapter 8!, the ligase was inactivated by heating the mixture to 72° C. for 15 minutes. The following reagents were added to the cDNA ligation reaction and heated at 37° C. for 30 minutes: cDNA ligation reaction (20 μl), water (24 μl) lox kinase buffer (3 μl), 10 mM ATP (1 μl) and kinase (2 μl of 2 μl). The reaction was stopped by the addition of 2 μl 0.5M EDTA, followed by one phenol/chloroform extraction and one chloroform extraction.

A. Size selection and Packaging of cDNA

The double-stranded cDNA with the EcoRI, SnaBI, XhoI adapters ligated was purified away from the free or unligated adapters using a 5 ml Sepharose CL-4B column. (Sigma, St. Louis, Mo.). 100 μl fractions were collected and those containing the cDNA, determined by monitoring the radioactivity, were pooled, ethanol precipitated, resuspended in TE buffer and loaded onto a 1% agarose gel. After the electrophoresis, the gel was stained with ethidium bromide and the 1 to 3 kb fraction was cut from the gel. The cDNA embedded in the agarose was eluted using the "Geneluter Electroelution System" (Invitrogen, San Diego, Calif.). The eluted cDNA was collected by ethanol precipitation and resuspended in TE buffer at 0.10 pmol/μl. The cDNA was ligated to 1 μg of EcoRI digested, dephosphorylated λgt11 in a 5 μl reaction volume at a 2- to 4- fold molar excess ratio of cDNA over the λgt11 vector. The ligated λgt11 containing the cDNA insert was packaged into λ phage virions in vitro using the Gigapack (Stratagene, La Jolla, Calif.) kit. The packaged phage were plated on an E. coli Y1088 bacterial lawn in preparation for screening.

2. IMR32 cDNA library #2

This library was prepared as described (Example I.B.1.) with the exception that 3 to 9 kb cDNA fragments were ligated into the λgt11 phage vector rather than the 1 to 3 kb fragments.

3. IMR32 cDNA library #3

IMR32 cell poly(A⁺) RNA (Example I.A.1.) was used as a template to synthesize single-stranded cDNA. The primers for the first strand cDNA synthesis were random primers (hexadeoxy-nucleotides pd(N)₆ ! Cat #5020-1, Clontech, Palo Alto, Calif.). The double-stranded cDNA was synthesized (Example 1.B.1.), EcoRI, SnaBI, XhoI adapters were added to the cDNA (Example I.B.1.), the unligated adapters were removed (Example I.B.1.), and the double-stranded cDNA with the ligated adapters was fractionated on an agarose gel (Example I.B.1.). The cDNA fraction greater than 1.8 kb was eluted from the agarose (Example I.B.1.), ligated into λgt11, packaged, and plated into a bacterial lawn of Y1088 (Example I.B.1.).

4. IMR32 cDNA library #4

IMR32 cell poly(A+) RNA (Example I.A.1.) was used as a template to synthesize single-stranded cDNA. The primers for the first strand cDNA synthesis were oligonucleotides: 89-365a specific for the α_(1D) (VDCC III) type α₁ -subunit (see Example II.A.) coding sequence (the complementary sequence of nt 292, to 2956, SEQ ID No. 1), 89-495 specific for the α_(1C) (VDCC II) type α₁ -subunit (see Example II.B.) coding sequence (the complementary sequence of nt 852 to 873, SEQ ID No. 3), and 90-12 specific for the α_(1C) -subunit coding sequence (the complementary sequence of nt 24.96 to 2520, SEQ ID No. 3). The cDNA library was then constructed as described (Example I.B.3), except that the cDNA size-fraction greater than 1.5 kb was eluted from the agarose rather than the greater than 1.8 kb fraction.

5. IMR32 cDNA library #5

The cDNA library was constructed as described (Example I.B.3.) with the exception that the size-fraction greater than 1.2 kb was eluted from the agarose rather than the greater than 1.8 kb fraction.

6. Human thalamus cDNA library #6

Human thalamus poly (A⁺) RNA (Example I.A.2.) was used as a template to synthesize single-stranded cDNA. Oligo (dT) was used to prime the first strand synthesis (Example I.B.1.). The double-stranded cDNA was synthesized (Example I.B.1.) and EcoRI, KpnI, NcoI adapters of the following sequence:

    5'CCATGGTACCTTCGTTGACG 3'=20 mer (SEQ ID NO. 17)

    3'GGTACCATGGAAGCAACTGCTTAA 5'=24 mer (SEQ ID No. 18)

were ligated to the double-stranded cDNA as described (Example I.B.1.) with the 20-mer replacing the 18-mer and the 24-mer replacing the 22-mer. The unligated adapters were removed by passing the cDNA-adapter mixture through a 1 ml Bio Gel A-50 (Bio-Rad Laboratories, Richmond, Calif.) column. Fractions (30 μl) were collected and 1 μl of each fraction in the first peak of radioactivity was electrophoresed on a 1% agarose gel. After electrophoresis, the gel was dried on a vacuum gel drier and exposed to x-ray film. The fractions containing cDNA fragments greater than 600 bp were pooled, ethanol precipitated, and ligated into λgt11 (Example I.B.1.). The construction of the cDNA library was completed as described (Example I.B.1.).

C. Hybridization and washing Conditions

Hybridization of radiolabelled nucleic acids to immobilized DNA for the purpose of screening cDNA libraries, DNA Southern transfers, or northern transfers was routinely performed in standard hybridization conditions (5 x SSPE, 5x Denhardt's, 50% deionized formamide, 200 μg/ml sonicated herring sperm DNA (Cat 0223646, Boehringer Mannheim Biochemicals, Indianapolis, Ind.)!. The recipes for SSPE and Denhardt's and the preparation of deionized formamide are described, for example, in Sambrook et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, Chapter 8). In some hybridizations, lower stringency conditions were used in that 10% deionized formamide replaced 50% deionized formamide described for the standard hybridization conditions.

The washing conditions for removing the non-specific probe from the filters was either high, medium, or low stringency as described below:

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65° C.

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50° C.

3) low stringency: 1.0 x SSPE, 0.1% SDS, 50° C.

It is understood that equivalent stringencies may be achieved using alternative buffers, salts and temperatures.

EXAMPLE II ISOLATION OF DNA ENCODING THE HUMAN NEURONAL CALCIUM CHANNEL α₁ SUBUNIT

A. Isolation of DNA encoding the α_(ID) subunit

1. Reference list of partial α_(ID) cDNA clones

Numerous α_(ID) -specific cDNA clones were isolated in order to characterize the complete α_(ID) coding sequence plus portions of the 5' and 3' untranslated sequences. SEQ ID No. 1 shows the complete α_(1D) DNA coding sequence, plus 510 nucleotides of α_(ID) 5' untranslated sequence ending in the quanidine nucleotide adjacent to the adenine nucleotide of the proposed initiation of translation as well as 642 nucleotides of 3' untranslated sequence. Also shown in SEQ ID No. 1 is the deduced amino acid sequence, A list of partial cDNA clones used to characterize the α_(ID) sequence and the nucleotide position of each clone relative to the full-length α_(ID) cDNA sequence, which is set forth in SEQ ID No. 1, is shown below. The isolation and characterization of these clones are described below (Example II.A.2.).

    ______________________________________                                         IMR32    1.144   nt. 1 to 510 of SEQ ID No. 1                                                   5' untranslated sequence,                                                      nt. 511 to 2431,                                                                               SEQ ID No. 1                                  IMR32*   1.136   nt. 1627 to 2988,                                                                              SEQ ID No. 1                                                   nt. 1 to 104 of SEQ ID No. 2                                                   additional exon,                                              IMR32@   1.80    nt. 2083 to 6468,                                                                              SEQ ID No. 1                                  IMR32.sup.#                                                                             1.36    nt. 2857 to 4281,                                                                              SEQ ID No. 1                                  IMR32    1.163   nt. 5200 to 7635,                                                                              SEQ ID No. 1                                  ______________________________________                                          *5' of nt 1627, IMR32 1.136 encodes an intron and an additional exon           described in Example II.A.2.d.                                                 @IMR32 1.80 contains two deletions, nt 2984 to 3131 and nt 5303 to 5349        (SEQ ID No. 1). The 148 nt deletion (nt. 2984 to 3131) was corrected by        performing a polymerase chain reaction described in Example II.A.3.b.          .sup.# IMR32 1.36 contains a 132 nt deletion (nt. 3081 to 3212).         

2. Isolation and characterization of individual clones listed in Example II.A.1.

a. IMR32 1.36

Two million recombinants of the IMR32 cDNA library #1 (Example I.B.1.) were screened in duplicate at a density of approximately 200,000 plaques per 150 mm plate using a mixture of radiolabelled fragments of the coding region of the rabbit skeletal muscle calcium channel α₁ cDNA for the sequence of the rabbit skeletal muscle calcium channel α₁ subunit cDNA, see, Tanabe et al. (1987). Nature 328:313-318!:

    ______________________________________                                         Fragment       Nuoleotides                                                     ______________________________________                                         KpnI--EcoRI     -78 to 1006                                                    EcoRI--XhoI    1006 to 2653                                                    ApaI--ApaI     3093 to 4182                                                    BglII--SacI    4487 to 5310                                                    ______________________________________                                    

The hybridization was performed using low stringency hybridization conditions (Example I.C.) and the filters were washed under low stringency (Example I.C.). Only one α_(1D) -specific recombinant (IMR32 1.36) of the 2×10⁶ screened was identified. IMR32 1.36 was plaque purified by standard methods (J. Sambrook et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, Chapter 8) subcloned into pGEM3 (Promega, Madison, Wis.) and characterized by DNA sequencing.

b. IMR32 1.80

Approximately 1×10⁶ recombinants of the IMR32 cDNA library 12 (Example I.B.2.) were screened in duplicate At a density of approximately 100,000 plaques per 150 mm plate using the IMR32 1.36 cDNA fragment (Example II.A.1) as a probe. Standard hybridization conditions were used (Example I.C), and the filters were washed under high stringency (Example I.C.). Three positive plaques were identified one of which was IMR32 1.80. IMR32 1.80 was plaque purified by standard methods, restriction mapped, subcloned, and characterized by DNA sequencing.

C. IXR32 1.144

Approximately 1×10⁵ recombinants of the IMR32 cDNA library #3 (Example I.B.3) were screened with the EcoRI-PvuII fragment (nt 2083 to 2518, SEQ ID No. 1) of IMR32 1.80. The hybridization was performed using standard hybridization conditions (Example I.C.) and the filters were washed under high stringency (Example I.C.). Three positive plaques were identified one of which was IMR32 1.144. IM32 1.144 was plaque purified, restriction mapped, and the cDNA insert was subcloned into pGEM7Z (Promega, Madison, Wis.) and characterized by DNA sequencing. This characterization revealed that IMR32 1.144 has a series of ATG codons encoding seven possible initiating methionines (nt 511 to 531, SEQ ID No: 1). PCR analysis, and DNA sequencing of cloned PCR products encoding these seven ATG codons confirmed that this sequence is present in the α_(1D) transcript expressed in dbcAMP-induced IMR32 cells.

d. IMR32 1.136

Approximately 1×10⁶ recombinants of the IMR32 cDNA library #4 (Example I.B.4) were screened with the EcoRI-PvuII fragment (nt 2083 to 2518, SEQ ID No. 1) of IMR32 1.80 (Example II.A.1.). The hybridization was performed using standard hybridization conditions (Example I.C.) and the filters were washed under high stringency (Example I.C.). Six positive plaques were identified one of which was IMR32 1.136. IMR32 1.136 was plaque purified, restriction mapped, and the cDNA insert was subcloned into a standard plasmid vector, pSP72 (Promega, Madison, Wis.), and characterized by DNA sequencing. This characterization revealed that IMR32 1.136 encodes an incompletely spliced α_(1D), transcript. The clone contains nucleotides 1627 to 2988 of SEQ ID No. 1 preceded by an approximate 640 bp intron. This intron is then preceded by a 104 nt ekon (SEQ ID No. 2), which is an alternative exon encoding the IS6 transmembrane domain see, e.g., Tanabe et al. (1987) Nature 328:313-318 for a description of the IS1 to IVS6 transmembrane terminology! of the α_(ID) subunit and can replace nt 1627 to 1730, SEQ ID No. 1, to produce a completely spliced α_(ID) transcript.

e. IMR32 1.163

Approximately 1×10⁶ recombinants of the IMR32 cDNA library #3 (Example I.B.3.) were screened with the NcoI-XhoI fragment of IMR32 1.80 (Example II.A.1.) containing nt 5811 to 6468 (SEQ ID No. 1). The hybridization was performed using standard hybridization conditions (Example I.C.) and the filters were washed under high stringency (Example I.C.). Three positive plaques were identified one of which was IMR32 1.163. IMR32 1.163 was plaque purified, restriction mapped, and the cDNA insert was subcloned into a standard plasmid vector, pSP72 (Promega, Madison, Wis.), and characterized by DNA sequencing. This characterization revealed that IMR32 1.163 contains the α_(ID) termination codon, nt 6994 to 6996 (SEQ ID No. 1).

3. Construction of a full-length α_(ID) cDNA pVDCCIII(A)!

α_(ID) cDNA clones IMR32 1.144, IMR32 1.136, IMR32 1.80, and IMR32 1.163 (Example II.A.2.) overlap and include the entire α_(ID) coding sequence, nt 511 to 6993 (SEQ ID No. 1), with the exception of a 148 bp deletion, nt 2984 to 3131 (SEQ ID No. 1). Portions of these partial cDNA clones were ligated to generate a full-length α_(ID) cDNA in a eukaryotic expression vector. The resulting vector was called pVDCCIII(A). The construction of PVDCOIII(A) was performed in four steps described in detail below: (1) the construction of pVDCCIII/5' using portions of IMR32 1.144, IMR32 1.136, and IMR32 1.80, (2) the construction of pVDCCIII/5'.3 that corrects the 148 nt deletion in the IMR32 1.80 portion of pVDCCIII/5', (3) the construction of pVDCCIII/3'.1 using portions of IMR32 1.80 and IMR32 1.163, and (4) the ligation of a portion of the pVDCCIII/5'.3 insert, the insert of pVDCCIII/3'.1, and pcDNA1 (Invitrogen, San Diego, Calif.) to form pVDCCIII(A). The vector pcDNA1 is a eukaryotic expression vector containing a cytomegalovirus (CMV) promoter which is a constitutive promoter recognized by mammalian host cell RNA polymerase II.

Each of the DNA fragments used in preparing the full-length construct was purified by electrophoresis through an agarose gel onto DE81 filter paper (Whatoan, Clifton, N.J.) and elution from the filter paper using 1.0M. NaCl, 10 mM TRIS, pH 8.0, 1 mM EDTA. The ligations typically were performed in a 10 μl reaction volume with an equal molar ratio of insert fragment and a two-fold molar excess of the total insert relative to the vector. The amount of DNA used was normally about 50 ng to 100 ng.

a. pVDCCIII/5'

To construct pVDCCIII/5', IMR32 1.144 (Example II.A.2.c.) was digested with XhoI and EcoRI and the fragment containing the vector (pGEM7Z), α_(1D) nt 1 to 510 (SEQ ID No. 1), and α_(1D) nt 511 to 1732 (SEQ ID No. 1) was isolated by gel electrophoresis. The EcoRI-ApaI fragment of IMR32 1.136 (Example II.A.2.d.) nucleotides 1732 to 2667 (SEQ ID No. 1) was isolated, and the ApaI-HindIII fragment of IMR32 1.80 (Example II.A.2.b.), nucleotides 2667 to 4492 (SEQ ID No. 1) was isolated. The three DNA clones were ligated to form pVDCCIII/5' containing nt 1 to 510 (5' untranslated sequence; SEQ ID No. 1) and nt 511 to 4492 (SEQ ID No. 1).

b. pVDCCIII/5'.3

Comparison of the IMR32 1.36 and IMR32 1.80 DNA sequences revealed that these two cDNA clones differ through the AID coding sequence, nucleotides 2984 to 3212. PCR analysis of IMR32 1.80 and dbcAMP-induced,(1.0 mM, 10 days) IMR32 cytoplasmic RNA (isolated according to Ausubel, F. M. et al. (Eds) (1988) Current Protocols in Molecular Biology, John Wiley and Sons, New York) revealed that IMR32 1.80 had a 148 nt deletion, nt 2984 to 3131 (SEQ ID No. 1), and that IMR32 1.36 had a 132 nt deletion, nt 3081 to 3212. To perform the PCR analysis, amplification was primed with α_(1D) -specific oligonucleotides 112 (nt 2548 to 2572, SEQ ID No. 1) and 311 (the complementary sequence of nt 3928 to 3957, SEQ ID No. 1). These products were then reamplified using α_(1D) -specific oligonucleotides 310 (nt 2583 to 2600 SEQ ID No. 1) and 312 (the complementary sequence of nt 3883 to 3909). This reamplified product, which contains AccI and BglII restriction sites, was digested with AccI and BglII and the AccI-BglII fragment, nt 2764 to 3890 (SEQ ID No. 1) was cloned into AccI-BgIII digested pVDCCIII/5' to replace the AccI-BglII pVDCCIII/5' fragment that had the deletion. This new construct was named pVDCCIII/5'.3. DNA sequence determination of pVDCCIII/5'.3 through the amplified region confirmed the 148 nt deletion in IMR32 1.80.

C. pVDCCIII/3'.1

To construct pVDCCIII/3'.1, the cDNA insert of IMR32 1.163 (Example II.A.2.e.) was subcloned into pBluescript II (Stratagene, La Jolla, Calif.) as an XhoI fragment. The XhoI sites on the cDNA fragment were furnished by the adapters used to construct the cDNA library (I.B.3.). The insert was oriented such that the translational orientation of the insert of IMR32 1.163 was opposite to that of the lacZ gene present in the plasmid, as confirmed by analysis of restriction enzyme digests of the resulting plasmid. This was done to preclude the possibility of expression of α_(1D) sequences in DH5α cells transformed with this plasmid due to fusion with the lacz gene. This plasmid was then digested with HindlIl and BglII and the HindIII--BglII fragment (the HindIII site comes from the vector and the BglII site is at nt 6220, SEQ ID No. 1) was eliminated, thus deleting nt 5200 to 6220 (SEQ ID No. 1) of the IMR32 1.163 clone and removing this sequence from the remainder of the plasmid which contained the 3' BglII--XhoI fragment, nt 6220 to 7635 (SEQ ID No. 1). pVDCCIII/3'.1 was then made by splicing together the HindIII-PvuII fragment from IMR32 1.80 (nucleotides 4492-5294, SEQ ID No. 1), the PvuII--BgII fragment of IMR32 1.163 (nucleotides 5294 to 6220, SEQ TD No. 1) and the HindIII-BglII-digested pBluescript plasmid containing the 3' BglII/Xho3 IMR32 1.163 fragment (nt 6220 to 7635, SEQ ID No. 1).

d. pVDCCIII(A): the full-length α_(1D) construct

To construct pVDCCIII(A), the DraI-HindIII fragment (5' untranslated sequence nt 327 to 510, SEQ ID No. 1 and, coding sequence nt 511 to 4492, SEQ ID, No. 1) of pVDCCIII/5'.3 (Example II.A.3.b.) was isolated; the HindIII-XhoI fragment of pVDCCIII/3'.1 (containing nt 4492 to 7635, SEQ ID No. 1, plus the XhoI site of the adapter) (Example II.A.3.c.) was isolated; and the plasmid vector, pcDNA1, was digested with EcoRV and XhoI and isolated on an agarose gel. The three DNA fragments were ligated and MC1061-P3 (Invitrogen, San Diego, Calif.) was transformed. Isolated clones were analyzed by restriction mapping and DNA sequencing and pVDCCIII(A) was identified which had the fragments correctly ligated together: DraI-HindIII, HindIII-XhoI, XhoI-EcoRV with the blunt-end DraI and EcoRV site ligating together to form the circular plasmid.

The amino-terminus of the α_(1D) subunit is encoded by the seven consecutive 5' methionine codons (nt 511 to 531, SEQ ID No. 1). This 5' portion plus nt 532 to 537, encoding two lysine residues, were deleted from pVDCCIII(A) and replaced with an efficient ribosomal binding site (5'-ACCACC-3') to form pVDCCIII.RBS (A). Expression experiments in which transcripts of this construct where injected into Xenopus laevis oocytes did not result in an enhancement in the recombinant voltage-dependent calcium channel expression level relative to the level of expression in oocytes injected with transcripts of pVDCCIII(A).

B. Isolation of DNA encoding the α_(1C) subunit

1. Reference List of Partial α_(1C) cDNA clones

Numerous α_(1C) -specific cDNA clones were isolated in order to characterize the α_(1C) coding sequence, the α_(1C) initiation of translation, and an alternatively spliced region of α_(1C). SEQ ID No. 3 sets forth the characterized α_(1C) coding sequence nt 1 to 5904) and deduced amino acid sequence. SEQ ID No. 4 and No. 5 encode two possible amino terminal ends of the α_(1C) protein. SEQ ID No. 6 encodes an alternative exon for the IV S3 transmembrane domain. Shown below is a list of clones used to characterize the α_(1C) sequence and the nucleotide position of each clone relative to the characterized α_(1C) sequence (SEQ ID No. 3). The isolation and characterization of these cDNA clones are described below (Example II.B.2).

    ______________________________________                                         IMR32      1.66      nt 1 to 916, SEQ ID No. 3                                                      nt 1 to 132, SEQ ID No. 4                                 IMR32      1.157     nt 1 to 873, SEQ ID No. 3                                                      nt 1 to 89, SEQ ID No. 5                                  IMR32      1.67      nt 50 to 1717, SEQ ID No. 3                               *IMR32     1.86      nt 1366 to 2583, SEQ ID No. 3                             *1.16G               nt 758 to 867, SEQ ID No. 3                               IMR32      1.37      nt 2804 to 5904, SEQ ID No. 3                             CNS        1.30      nt 2199 to 3903, SEQ ID No. 3                                                  nt 1 to 84 of alternative exon,                                                SEQ ID No. 6                                              IMR32      1.38      nt 2448 to 4702, SEQ ID No. 3                                                  nt 1 to 84 of alternative exon,                                                SEQ ID No. 6                                              ______________________________________                                          *IMR32 1.86 has a 73 nt deletion compared to the rabbit cardiac muscle         calcium channel α.sub.1 subunit cDNA sequence.                           *1.16G is an α.sub.1 C genomic clone.                              

2. Isolation and characterization of clones described in Example 11.8.1.

a. CNS 1.30

Approximately 1×10⁶ recombinants of the human thalamus cDNA library No. 6 (Example I.B.6.) were screened with fragments of the rabbit skeletal muscle calcium channel α₁ cDNA described in. Example II.A.2.a. The hybridization was performed using standard hybridization conditions (Example I.C.) and the filters were washed under low stringency (Example I.C.). Six positive plaques were identified, one of which was CNS 1.30. CNS 1.30 was plaque purified, restriction mapped, subcloned, and characterized by DNA sequencing. CNS 1.30 encodes α_(1C) -specific sequence nt 2199 to 3903 (SEQ ID No. 3) followed by nt 1 to 84 of one of two identified alternative α_(1C) exons (SEQ ID No. 6). 3' of SEQ ID No. 6, CNS 1.30 contains an intron and, thus, CNS 1.30 encodes a partially spliced α_(1C) transcript.

b. 1.16G

Approximately 1×10⁶ recombinants of a λEMBL3-based human genomic DNA library (Cat # HL1006d Clontech Corp., Palo Alto, Calif.) were screened using a rabbit skeletal muscle cDNA fragment (nt -78 to 1006, Example II.A.2.a.). The hybridization was performed using standard hybridization conditions (Example I.C.) and the filters were washed under low stringency (Example I.C.). Fourteen positive plaques were identified, one of which was 1.16G. Clone 1.16G was plaque purified, restriction mapped, subcloned, and portions were characterized by DNA sequencing. DNA sequencing revealed that 1.16G encodes α_(1C) -specific sequence as described in Example II.B.1.

c. IMR32 1.66 and IMR32 1.67

Approximately 1×10⁶ recombinants of IR32 cDNA library #5 (Example I.B.S.) were screened with a 151 bp KpnI-SacI fragment of 1.16G (Example II.B.2.b.) encoding tic sequence (nt 758 to 867, SEQ ID No. 3). The hybridization was performed using standard hybridization conditions (Example I.C.). The filters were then washed in 0.5 x SSPE at 65° C. Of the positive plaques, IMR32 1.66 and IMR32 1.67 were identified. The hybridizing plaques, were purified, restriction mapped, subcloned, and characterized by DNA sequencing. Two of these cDNA clones, IMR32 1.66 and 1.67, encode α_(1C) subunits as described (Example II.B.1.). In addition, IMR32, 1.66 encodes a partially spliced α_(1C) transcript marked by a GT splice donor dinucleotide beginning at the nucleotide 3' of nt 916 (SEQ ID No. 3). The intron sequence within 1.66 is 101 nt long. IMR32 1.66 encodes the α_(1C) initiation of translation, nt 1 to 3 (SEQ ID No. 3) and 132 nt of 5' untranslated sequence (SEQ ID No. 4) precede the start codon in IMR32 1.66.

d. IMR32 1.37 and IMR32 1.38

Approximately 2×10⁶ recombinants of IMR32 cDNA library #1 (Example I.B.1.) were screened with the CNS 1.30 cDNA fragment (Example II.B.2.a.). The hybridization was performed using low stringency hybridization conditions (Example I.C.) and the filters were washed under low stringency (Example I.C.). Four positive plaques were identified, plaque purified, restriction mapped, subcloned, and characterized by DNA sequencing. Two of the clones, IMR32 1.37 and IMR32 1.38 encode α_(1C) -specific sequences as described in Example II.B.1.

DNA sequence comparison of IMR32 1.37 and IMR32 1.38 revealed that the α_(1C) transcript includes two exons that encode the IVS3 transmembrane domain. IMR32 1.37 has a single exon, nt 3904 to 3987 (SEQ ID No. 3) and IMR32 1.38 appears to be anomalously spliced to contain both exons juxtaposed, nt 3904 to 3987 (SEQ ID No. 3) followed by nt 1 to 84 (SEQ ID No. 6). The alternative splice of the α_(1C) transcript to contain either of the two exons encoding the IVS3 region was confirmed by comparing the CNS 1.30 sequence to the IMR32 1.37 sequence. CNS 1.30 contains nt 1 to 84 (SEQ ID No. 6) preceded by the identical sequence contained in IMR32 1.37 for nt 2199 to 3903 (SEQ ID No. 3). As described in Example II.B.2.a., an intron follows nt 1 to 84 (SEQ ID No. 6). Two alternative exons have been spliced adjacent to nt 3903 (SEQ ID No. 3) represented by CNS 1.30 and IMR32 1.37.

e. IMR32 1.86

IMR32 cDNA library #1 (Example I.B.1.) was screened in duplicate using oligonucleotide probes 90-9 (nt 1462 to 1491, SEQ ID No. 3) and 90-12 (nt 2496 to 2520, SEQ ID No. 3). These oligonucleotide probes were chosen in order to isolate a clone that encodes the α_(1C) subunit between the 3' end of IMR32 1.67 (nt 1717, SEQ ID No. 3) and the 5' end of CNS 1.30 (nt 2199, SEQ ID No. 3). The hybridization conditions were standard hybridization conditions (Example I.C.) with the exception that the 50% deionized formamide was reduced to 20%. The filters were washed under low stringency (Example I.C.). Three positive plaques were identified one of which was IMR32 1.86. IMR32 1.86 was plaque purified, subcloned, and characterized by restriction mapping and DNA sequencing. IMR32 1.86 encodes α_(1C) sequences as described in Example II.B.1. Characterization by DNA sequencing revealed that IMR32 1.86 contains a 73 nt deletion compared to the DNA encoding rabbit cardiac muscle calcium channel α₁ subunit Mikami et al. (1989) Nature 340:230!, nt 2191 to 2263. These missing nucleotides correspond to nt 2176-2248 of SEQ ID No. 3. Because the 5'-end of CNS 1.30 overlaps the 3'-end of IMR32 1.86, some of these missing nucleotides, i.e., nt 2205-2248 of SEQ ID No. 3, are accounted for by CNS 1.30. The remaining missing nucleotides of the 73 nucleotide deletion in IMR32 1.86 (i.e., nt 2176-220. SEQ ID No. 3) were determined by PCR analysis of dbcAMP-induced IMR32 cell RNA. The 73 nt deletion is a frame-shift mutation and, thus, needs to be corrected. The exact human sequence through this region, (which has been determined by the DNA sequence of CNS 1.30 and PCR analysis of IMR32 cell RNA) can be inserted into IMR32 1.86 by standard methods, e.g., replacement of a restriction fragment or site-directed mutagenesis.

f. IMR32 1.1S7

One million recombinants of IMR32 cDNA library #4 (Example I.B.4.) were screened with an XhoI-EcoRI fragment of IMR32 1.67 encoding α_(1C) nt 50 to 774 (SEQ ID No. 3). The hybridization was performed using standard hybridization conditions (Example I.C.). The filters were washed under high stringency (Example I.C.). One of the positive plaques identified was IMR32 1.157. This plaque was purified, the insert was restriction mapped and subcloned to a standard plasmid vector pGEM7Z (Promega, Madison, Wis.). The DNA was characterized by sequencing. IMR32 1.157 appears to encodes an alternative 5' portion of the α_(1C) sequence beginning with nt 1 to 89 (SEQ ID No. 5) and followed by nt 1 to 873 (SEQ ID No. 3). Analysis of the 1.66 and 1.157 5' sequence is described below (Example II.B.3.).

3. Characterization of the α_(1C) initiation of translation site

Portions of the sequences of IMR32 1.157 (nt 57 to 89, SEQ ID No. 5; nt 1 to 67, SEQ ID No. 3), IMR32 1.66 (nt 100 to 132, SEQ ID No. 4; nt 1 to 67, SEQ ID No. 3), were compared to he rabbit lung CaCB-receptor cDNA sequence, nt -33 to 67 Biel et al. (1990) FEBS Lett. 269:409!. The human sequences are possible alternative 5' ends of the α_(1C) transcript encoding the region of initiation of translation. IMR32 1.66 closely matches the CaCB receptor cDNA sequence and diverges from the CaCB receptor cDNA sequence in the 5' direction beginning at nt 122 (SEQ ID No. 4). The start codon identified in the CaCB receptor cDNA sequence is the same start codon used to describe the α_(1C) coding sequence, nt 1 to 3 (SEQ ID No. 3). The functional significance of the IMR32 1.157 sequence, nt 1 to 89 (SEQ ID No. 5), is not clear. Chimeras containing sequence between 1.157 and the α_(1C) coding sequence can be constructed and functional differences can be tested.

C. Isolation of partial cDNA clones encoding the α_(1B) subunit and construction of a full-length clone

A human basal ganglia cDNA library was screened with the rabbit skeletal muscle α₁ subunit cDNA fragments (see Example II.A.2.a for description of fragments) under low stringency conditions. One of the hybridizing clones was used to screen an IMR32 cell cDNA library to obtain additional partial α_(1B) cDNA clones, which were in turn used to further screen an IMR32 cell cDNA library for additional partial cDNA clones. One of the partial IMR32 α_(1B) clones was used to screen a human hippocampus library to obtain a partial α_(1B) clone encoding the 3' end of the α_(1B) coding sequence. The sequence of some of the regions of the partial cDNA clones was compared to the sequence of products of PCR analysis of IMR32 cell RNA to determine the accuracy of the cDNA sequences.

PCR analysis of IMR32 cell RNA and genomic DNA using oligonucleotide primers corresponding to sequences located 5' and 3' of the STOP codon of the DNA encoding the α_(1B) subunit revealed an alternatively spliced α_(1B) -encoding mRNA in IMR32 cells. This second mRNA product is the result of differential splicing of the α_(1B) subunit transcript to include another exon that is not present in the mRNA corresponding to the other 3' α_(1B) cDNA sequence that was initially isolated. To distinguish these splice variants of the α_(1B) subunit, the subunit encoded by a DNA sequence corresponding to the form containing the additional exon is referred to as α_(1B-1) (SEQ ID No. 7), whereas the subunit encoded by a DNA sequence corresponding to the form lacking the additional exon is referred to as α_(1B-2) (SEQ ID No. 8). The sequence of α_(1B-1) diverges from that of α_(1B-2) beginning at nt 6633 (SEQ ID No. 7). Following the sequence of the additional exon in α_(1B-1) (nt 6633-6819; SEQ ID No. 7), the α_(1B-1) and α_(1B-2) sequences are identical (i.e., nt 6820-7362 in SEQ ID No. 7 and nt 6633-7175 in SEQ ID No. 8). SEQ ID No. 7 and No. 8 set forth 143 nt of 5' untranslated sequence (nt 1-143) as well as 202 nt of 3' untranslated sequence (nt 7161-7362, SEQ ID No. 7) of the DNA encoding α_(1B-1) and 321 nt of 3' untranslated sequence (nt 6855-7175, SEQ ID No. 8) of the DNA encoding α_(1B-2).

PCR analysis of the IS6 region of the α_(1B) transcript revealed what appear to be additional splice variants based on multiple fragment sizes seen on an ethidium bromide-stained agarose gel containing the products of the PCR reaction.

A full-length α_(1B-1) cDNA clone designated pcDNA-α_(1B-1) was prepared in an eight-step process as follows.

STEP 1: The SacI restriction site of pGEM3 (Promega, Madison, Wis.) was destroyed by digestion at the SacI site, producing blunt ends by treatment with T4 DNA polymerase, and religation. The new vector was designated pGEMΔSac.

STEP 2: Fragment 1 (HindIII/KpnI; nt 2337 to 4303 of SEQ ID No. 7) was ligated into HindIII/KpnI digested pGEM3ΔSac to produce pα1.177HK.

STEP 3: Fragment 1 has a 2 nucleotide deletion (nt 3852 and 3853 of SEQ ID No. 7). The deletion was repaired by inserting a PCR fragment (fragment 2) of IMR32 RNA into pα1.177HK. Thus, fragment 2 (NarI/KpnI; nt 3828 to 4303 of SEQ ID No. 7) was inserted into NarI/XpnI digested pα1.177HK replacing the NarI/KpnI portion of fragment 1 and producing pα1.177HK/PCR.

STEP 4: Fragment 3 (XpnI/KpnI; nt,4303 to 5663 of SEQ ID No. 7) was ligated into KpnI digested pα1.177HK/PCR to produce pα1B5'K.

STEP 5: Fragment 4 (EcoRI/HindIII; EcoRI adaptor plus nt 1 to 2337 of SEQ ID No. 7) and fragment 5 (HindIII/XhoI fragment of pα1B5'K; nt 2337 to 5446 of SEQ ID No. 7) were ligated together into EcoRI/XhoI digested pcDNA1 (Invitrogen, San Diego, Calif.) to produce pα1B5'.

STEP 6: Fragment 6 (EcoRI/EcoRI; EcoRI adapters on both ends plus nt 5749 to 7362 of SEQ ID No. 7) was ligated into EcoRI digested pBluescript II KS (Stratagene, La Jolla, Calif.) with the 5' end of the fragment proximal to the KpnI site in the polylinker to produce pα1.230.

STEP 7: Fragment 7 (XpnI/XhoI; nt 4303 to 5446 of SEQ ID No. 7), and fragment 8 (XhoI/CspI, nt 5446 to 6259 of SEQ ID No. 7) were ligated into XpnI/CspI digested pal.230 (removes nt 5749 to 6259 of SEQ ID No. 7 that was encoded in pα1.230 and maintains nt 6259 to 7362 of SEQ ID No. 7) to produce pα1B3'.

STEP 8: Fragment 9 (SphI/XhoI; nt 4993 to 5446 of SEQ ID No. 7) and fragment 10 (XhoI/XbaI of pα1B3'; nt 5446 to 7319 of SEQ ID No. 7) were ligated into SphI/XbaI digested pα1B5' (removes nt 4993 to 5446 of SEQ ID No. 7 that were encoded in pα1B5' and maintains nt 1 to 4850 of SEQ ID No. 7) to produce pcDNAα_(1B-1).

The resulting construct, pcDNAα_(1B-1), contains, in pcDNA1, a full-length coding region encoding α_(1B-1) (nt 144-7362, SEQ ID No. 7), plus 5' untranslated sequence (nt 1-143, SEQ ID No. 7) and 3' untranslated sequence (nt 7161-7319, SEQ ID No. 7) under the transcriptional control of the CMV promoter.

EXAMPLE III ISOLATION OF cDNA CLONES ENCODING THE HUMAN NEURONAL CALCIUM CHANNEL β₁ subunit

A. Isolation of partial cDNA clones encoding the β subunit and construction of a full-length clone encoding the β₁ subunit

A human hippocampus cDNA library was screened with the rabbit skeletal muscle calcium channel β₁ subunit cDNA fragment (nt 441 to 1379) for isolation and sequence of the rabbit skeletal muscle calcium channel β₁ subunit cDNA, see U.S. patent application Ser. No. 482,384 or Ruth et al. (1989) Science 245:1115! using standard hybridization conditions (Example I.C.). A portion of one of the hybridizing clones was used to rescreen the hippocampus library to obtain additional cDNA clones. The cDNA inserts of hybridizing clones were characterized by restriction mapping and DNA sequencing and compared to the rabbit skeletal muscle calcium channel β₁ subunit cDNA sequence.

Portions of the partial β₁ subunit cDNA clones were ligated to generate a full-length clone encoding the entire β₁ subunit. SEQ ID No. 9 shows the β₁ subunit coding sequence (nt 1-1434) as well as a portion of the 3' untranslated sequence (nt 1435-1546). The deduced amino acid sequence is also provided in SEQ ID No. 9. In order to perform expression experiments, full-length β₁ subunit cDNA clones were constructed as follows.

Step 1: DNA fragment 1 (˜800 bp of 5' untranslated sequence plus nt 1-277 of SEQ ID No. 9) was ligated to DNA fragment 2 (nt 277-1546 of SEQ ID No. 9 plus 448 bp of intron sequence) and cloned into pGEM7Z. The resulting plasmid, pβ1-1.18, contained a full-length β₁ subunit clone that included a 448-bp intron.

Step 2: To replace the 5' untranslated sequence of pβ1-1.18 with a ribosome binding site, a double-stranded adapter was synthesized that contains an EcoRI site, sequence encoding a ribosome binding site (5'-ACCACC-3') and nt 1-25 of SEQ ID No. 9. The adapter was ligated to Smal-digested pβ1-1.18, and the products of the ligation reaction were digested with EcoRI.

Step 3: The EcoRI fragment from step 2 containing the EcoRI adapter, efficient ribosome binding site and nt 1-1546 of SEQ ID No. 9 plus intron sequence was cloned into a plasmid vector and designated pβ1-1.18RBS. The EcoRI fragment of pβ1-1.18RBS was subcloned into EcoRI-digested pcDNA1 with the initiation codon proximal to CMV promoter to form pHBCaCHβ_(1a) RBS(A).

Step 4: To generate a full-length clone encoding the β₁ subunit lacking intron sequence, DNA fragment 3 (nt 69-1146 of, SEQ ID No. 9 plus 448 bp of intron sequence followed by nt 1147-1546 of SEQ ID No. 9), was subjected to site-directed mutagenesis to delete the intron sequence, thereby yielding pα1(-). The EcoRI-XhoI fragment of pβ1-1.18RBS (containing of the ribosome binding site and nt 1-277 of SEQ ID No. 9) was ligated to the XhoI-EcoRI fragment of pβ1(-) (containing of nt 277-1546 of SEQ ID No. 9) and cloned into pcDNA1 with the initiation of translation proximal to the CMV promoter. The resulting Expression plasmid was designated pHBCaCHβ_(1b) RBS(A).

B. Splice variant β₁₋₃

DNA sequence analysis of the DNA clones encoding the β₁ subunit indicated that in the CNS at least two alternatively spliced forms of the same human β₁ subunit primary transcript are expressed. One form is represented by the sequence shown in SEQ ID No. 9 and is referred to as β₁₋₂. The sequences of β1-2 and the alternative form, β₁₋₃, diverge at nt 1334 (SEQ ID No. 9). The complete β₁₋₃ sequence (nt 1-1851), including 3' untranslated sequence (nt 1795-1851), is set forth in SEQ ID No. 10.

EXAMPLE IV ISOLATION OF cDNA CLONES ENCODING THE HUMAN NEURONAL CALCIUM CHANNEL α₂ -subunit

A. Isolation of cDNA clones

The complete human neuronal α₂ coding sequence (nt 35-3307) plus a portion of the 5' untranslated sequence (nt 1 to 34) as well as a portion of the 3' untranslated sequence (nt 3308-3600) is set forth in SEQ ID No. 11.

To isolate DNA encoding the human neuronal α₂ subunit, human α2 genomic clones first were isolated by probing human genomic Southern blots using a rabbit skeletal muscle calcium channel α₂ subunit cDNA fragment nt 43 to 272, Ellis et al. (1988) Science 240:1661!. Human genomic DNA was digested with EcoRI, electrophoresed, blotted, and probed with the rabbit skeletal muscle probe using standard hybridization conditions (Example I.C.) and low stringency washing conditions (Example I.C.). Two restriction fragments were identified, 3.5 kb and 3.0 kb. These EcoRI restriction fragments were cloned by preparing a λgt11 library containing human genomic EcoRl fragments ranging from 2.2 kb to 4.3 kb. The library was screened as described above using the rabbit α₂ probe, hybridizing clones were isolated and characterized by DNA sequencing. HGCaCHα2.20 contained the 3.5 kb fragment and HGCaCHα2.9 contained the 3.0 kb fragment.

Restriction mapping and DNA sequencing revealed that HGCaCHα2.20 contains an 82 bp exon (nt 130 to 211 of the human α₂ coding sequence, SEQ ID No. 11) on a 650 bp Pstl-XbaI restriction fragment and that HGCaCα2.9 contains 105 bp of an exon (nt 212 to 316 of the coding sequence, SEQ ID No. 11) on a 750 bp XbaI-BglII restriction fragment. These restriction fragments were used to screen the human basal ganglia cDNA library (Example II.C.2.a.). HBCaCHα2.1 was isolated (nt 29 to 1163, SEQ ID No. 11) and used to screen a human brain stem cDNA library (ATCC Accession No. 37432) obtained from the American Type Culture Collection, 12301 Parklawn Drive, Rockville, Md. 20852. Two clones were isolated, HBCaCHα2.5 (nt 1 to 1162, SEQ ID No. 11) and HBCaCHβ2.8 (nt 714 to 1562, SEQ ID No. 11, followed by 1600 nt of intervening sequence). A 2400 bp fragment of HBCaCHα2.8 (beginning at nt 759 of SEQ ID No. 11 and ending at a SmaI site in the intron) was used to rescreen the brain stem library and to isolate HBCaCHα2.11 (nt 879 to 3600, SEQ ID No. 11). Clones HBCaCHα2.5 and HBCaCHα2.11 overlap to encode an entire human brain α₂ protein.

B. Construction of pHBCaCHα2

To construct pHBCaCHα2A containing DNA encoding a full-length human calcium channel α₂ subunit, an (EcoRI)-PvUII fragment of HBCaCHα2.5 (nt 1 to 1061, SEQ ID No. 11, EcoRI adapter, PvuII partial digest) and a PvuII-PstI fragment of HBCaCHα2.11 (nt 1061 to 2424 SEQ ID No. 11; PvuII partial digest) were ligated into EcoRI-PstI-digested pIBI24 (Stratagene, La Jolla, Calif.). Subsequently, an (EcoRI)-PstI fragment (nt 1 to 2424 SEQ ID No. 11) was isolated and ligated to a PstI-(EcoRI) fragment (nt 2424 to 3600 SEQ ID No. 11) of HBCaCHα2.11 in EcoRI-digested pIBI24 to produce DNA, HBCaCHα2, encoding a full-length human brain α₂ subunit. The 3600 bp EcoRI insert of HBCaCHα2 (nt 1 to 3600, SEQ ID No. 11) was subcloned into pcDNA1 (pHBaCHα2A) with the methionine initiating codon proximal to the CMV promoter. The 3600 bp EcoRI insert of HBCaCHα2 was also subcloned into pSV2dHFR Subramani et al. (1981). Mol. Cell. Biol. 1:854-8643 which contains the SV40 early promoter, mouse dihydrofolate reductase (dhfr) gene, SV40 polyadenylation and splice sites and sequences required for maintenance of the vector in bacteria.

EXAMPLE V DIFFERENTIAL PROCESSING OF THE HUMAN β₁ TRANSCRIPT AND THE HUMAN α₁ TRANSCRIPT

A. Differential processing of the β₁ transcript

PCR analysis of the human β₁ transcript present in skeletal muscle, aorta, hippocampus and basal ganglia, and HEK 293 cells revealed differential processing of the region corresponding to nt 615-781 of SEQ ID No. 9 in each of the tissues. Four different sequences that result in five different processed β₁ transcripts through this region were identified. The β₁ transcripts from the different tissues contained different combinations of the four sequences, except for one of the β₁ transcripts expressed in HEK 293 cells (β₁₋₅) which lacked all four sequences.

None of the β₁ transcripts contained each of the four sequences; however, for ease of reference, all four sequences are set forth end-to-end as a single long sequence in SEQ ID No. 12. The four sequences that are differentially processed are sequence 1 (nt 14-34 in SEQ ID No. 12), sequence 2 (nt 35-55 in SEQ ID No. 12), sequence 3 (nt 56-190 in SEQ ID No. 12) and sequence 4 (nt 191-271 in SEQ ID No. 12). The forms of the β₁ transcript that have been identified include: (1) a form that lacks sequence 1 called β₁₋₁ (expressed in skeletal muscle), (2) a form that lacks sequences 2 and 3 called β.sub. 1-2 (expressed in CNS), (3) a form that lacks sequences 1, 2 and 3 called β₁₋₄ (expressed in aorta and HEK cells) and (4) a form that lacks sequences 1-4 called β₁₋₅ (expressed in HEK cells). Additionally, the β₁₋₄ and β₁₋₅ forms contain the guanine nucleotide (nt 13 in SEQ ID No. 12) which is absent in the β₁₋₁ and β₁₋₂ forms. The sequences of DNA encoded the β₁ splice variants β₁₋₁, β₁₋₂, β₁₋₃, β₁₋₄ and β₁₋₅ are set forth in SEQ ID Nos. 26, 9, 10, 27 and 28, respectively

B. Differential processing of transcripts encoding the α₂ subunit.

The complete human neuronal α₂ coding sequence (nt 35-3307) plus a portion of the 5' untranslated sequence (nt 1 to 34) as well as a portion of the 3' untranslated sequence (nt 3308-3600) is set forth as SEQ ID No. 11.

PCR analysis of the human α₂ transcript present in skeletal muscle, aorta, and CNS revealed differential processing of the region corresponding to nt 1595-1942 of SEQ ID No. 11 in each of the tissues.

The analysis indicated that the primary transcript of the genomic DNA that includes the nucleotides corresponding to nt. 1595-1942 also includes an additional sequence (SEQ ID No. 13: 5'CCTATTGGTGTAGGTATACCAACAATTAATTT AAGAAAAAGGAGACCCAATATCCAG 3') inserted between nt. 1624 and 1625 of SEQ ID No. 11. Five alternatively spliced variant transcripts that differ in the presence or absence of one to three different portions of the region of the primary transcript that includes the region of nt. 1595-1942 of SEQ ID No. 11 plus SEQ ID No. 13 inserted between nt. 1624 and 1625 have been identified. The five α₂ -encocding transcripts from the different tissues include different combinations of the three sequences, except for one of the α₂ transcripts expressed in aorta which lacks all three sequences. None of the α₂ transcripts contained each of the three sequences. The sequences of the three regions that are differentially processed are sequence 1 (SEQ ID No. 13), sequence 2 (5' AACCCCAAATCTCAG 3,', which is nt. 1625-1639 of SEQ ID No. 11), and sequence 3 (5'CAAAAAAGGGCAAAATGAAGG 3', which is nt 1908-1928. of SEQ ID No. 11). The five α2 forms identified are (1) a form that lacks sequence 3 called α_(2a) (expressed in skeletal muscle), (2) a form that lacks sequence 1 called α_(2b) (expressed in CNS), (3) a form that lacks sequences 1 and 2 called α_(2c) expressed in aorta), (4) a form that lacks sequences 1, 2 and 3 called α₂ (expressed in aorta) and (5) a form that lacks sequences 1 and 3 called β_(2c) (expressed in aorta) The sequences of these splice variants, which are designated α_(2b), α_(2a), and α_(2c) -α_(2c) are set forth in SEQ. ID Nos. 11 and 22-25, respectively.

EXAMPLE VI ISOLATION OF DNA ENCODING A CALCIUM CHANNEL γ SUBUNIT FROM A HUMAN BRAIN cDNA LIBRARY

A isolation of DNA encoding the γ subunit

Approximately 1×10⁶ recombinants from a λgt11-based human hippocampus cDNA library (Clontech catalog #HL1088b, Palo Alto, Calif.) were screened by hybridization to a 484 bp sequence of the rabbit skeletal muscle calcium channel γ subunit cDNA (nucleotides 621-626 of the coding sequence plus 438 nucleotides of 3'-untranslated sequence) contained in vector γJ10 (Jay, S. et al. (1990). Science 248:490-4923. Hybridization was performed using moderate stringency conditions (20% deionized formamide, 5x Denhardt's, 6 x SSPE, 0.2% SDS, 20 μg/ml herring sperm DNA, 42° C.) and the filters were washed under low stringency (see Example I.C.). A plaque that hybridized to this probe was purified and insert DNA was subcloned into pGEM7Z. This cDNA insert was designated γ1.4.

B. Characterization of γ1.4

γ1.4 was confirmed by DNA hybridization and characterized by DNA sequencing. The 1500 bp SstI fragment of γ1.4 hybridized to the rabbit skeletal muscle calcium channel γ subunit cDNA γJ10 on a Southern blot. SEQ analysis of this fragment revealed that it contains of approximately 500 nt of human DNA sequence and ˜1000 nt of λgt11 sequence (included due to apparent destruction of one of the EcoRI cloning sites in λgt11). The human DNA sequence contains of 129 nt of coding sequence followed immediately by a translational STOP codon and 3' untranslated sequence (SEQ ID No. 14).

To isolate the remaining 5' sequence of the human γ subunit cDNA, human CNS cDNA libraries and/or preparations of mRNA from human CNS tissues can first be assayed by RCR methods using oligonucleotide primers based on the γ cDNA-specific sequence of γ1.4. Additional human neuronal γ subunit-encoding DNA can isolated from cDNA libraries that, based on the results of the PCR assay, contain γ-specific amplifiable cDNA. Alternatively, cDNA libraries can be constructed from mRNA preparations that, based on the results of pCR assays, contain γ-specific amplifiable transcripts. Such libraries are constructed by standard methods using oligo dT to prime first-strand cDNA synthesis from poly A+RNA (see Example I.B.). Alternatively, first-strand cDNA can be specified by priming first-strand cDNA synthesis with a γ cDNA-specific oligonucleotide based on the human DNA sequence in γ1.4. A cDNA library can then be constructed based on this first-strand synthesis and screened with the γ-specific portion of γ1.4.

EXAMPLE VII RECOMBINANT EXPRESSION OF HUMAN NEURONAL CALCIUM CHANNEL SUBUNIT-ENCODING cDNA AND RNA TRANSCRIPTS IN MAMMALIAN CELLS

A. Recombinant Expression of the Human Neuronal Calcium Channel α₂ subunit cDNA in DG44 Cells

1. Stable transfection of DG44cells

DG44 cells dhfr⁻ Chinese hamster ovary cells; see, e.g., Urlaub, G. et al. (1986) Som. Cell Molec. Genet. 12:555-566! obtained from Lawrence Chasin at Columbia University were stably transfected by CaPO₄ precipitation methods Wigler et al. (1979) Proc. Natl. Acad. Sci. USA 76:1373-13763 with pSV2dhfr vector containing the human neuronal calcium channel α₂ -subunit cDNA (see Example IV) for polycistronic expression/selection in transfected cells. Transfectants were grown on 10% DMEM medium without hypoxanthine or thymidine in order to select cells that had incorporated the expression vector. Twelve transfectant cell lines were established as indicated by their ability to survive on this medium.

2. Analysis of α₂ subunit cDNA expression in transfected DG44 cells

Total RNA was extracted according to the method of Birnboim (1988) Nuc. Acids Res. 16:1487-1497! from four of the DG44 cell lines that had been stably transtected with pSV2dhfr containing the human neuronal calcium channel α₂ subunit cDNA. RNA (˜15 μg per lane) was separated on a 1% agarose formaldehyde gel, transferred to nitrocellulose and hybridized to the random-primed human neuronal calcium channel α₂ cDNA (hybridization: 50% formamide, 5 x PESPE, 5 x Denhardt's, 42° C.; wash :0.2 x SSPE, 0.1% SDS, 65° C.). Northern blot analysis of total RNA from four of the DG44 cell lines that had been stably transfected with pSV2dhfr containing the human neuronal calcium channel α₂ subunit cDNA revealed that one of the four cell lines contained hybridizing mRNA the size expected for the transcript of the α₂ subunit cDNA (5000 nt based on the size of the cDNA) when grown in the presence of 10 mM sodium butyrate for two days. Butyrate nonspecifically induces transcription and is often used for inducing the SV40 early promoter Gorman, C. and Howard, B. (1983) Nucleic Acids Res. 11:1631!. This cell line, 44α₂₋₉, also produced mRNA species smaller (several species) and larger (6800 nt) than the size expected for the transcript of the α₂ cDNA (5000 nt) that hybridized to the α₂ cDNA-based probe. The 5000- and 6800-nt transcripts produced by this transfectant should contain the entire 2 subunit coding sequence and therefore should yield a full-length α₂ subunit protein. A weakly hybridizing 8000-nucleotide transcript was present in untransfected and transfected DG44 cells. Apparently, DG44 cells transcribe a calcium channel α₂ subunit or similar gene at low levels. The level of expression of this endogenous α₂ subunit transcript did not appear to be affected by exposing the cells to butyrate before isolation of RNA for northern analysis.

Total protein was extracted from three of the DG44 cell lines that had been stably transfected with pSV2dhfr containing the human neuronal calcium channel α₂ subunit cDNA. Approximately 10⁷ cells were sonicated in 300 μl of a solution containing 50 mM HEPES, 1 mM EDTA, 1 mM PMSP. An equal volume of 2x loading dye Laemmli, U.K. (1970). Nature 227:690! was added to the samples and the protein was subjected to electrophoresis on an 8% polyacrylamide gel and then electrotransferred to nitrocellulose. The nitrocellulose was incubated with polyclonal guinea pig antisera (1:200 dilution) directed against the rabbit skeletal muscle calcium channel α₂ subunit (obtained from K. Campbell, University of Iowa) followed by incubation with ¹²³ I!-protein A. The blot was exposed to X-ray film at -70° C. Reduced samples of protein from the transfected cells as well as from untransfected DG44 cells contained immunoreactive protein of the size expected for the α₂ subunit of the human neuronal calcium channel (130-150 kDa). The level of this immunoreactive protein was higher in 44α₂ -9 cells that had been grown in the presence of 10 mM sodium butyrate than in 44α₂ -9 cells that were grown in the absence of sodium butyrate. These data correlate well with those obtained in northern analyses of total RNA from 44α₂ -9 and untransfected DG44 cells. Cell line 44α₂ -9 also produced a 110 kD immunoreactive protein that may be either a product of proteolytic degradation of the full-length α₂ subunit or a product of translation of one of the shorter (<5000 nt) mRNAs produced in this cell line that hybridized to the α₂ subunit cDNA probe.

B. Expression of DNA encoding human neuronal calcium channel α₁, α₂ and β₁ subunits in NK cells

Human embryonic kidney cells (HEK 293 cells) were transiently and stably transfected with human neuronal DNA encoding calcium channel subunits. Individual transfectants were analyzed electrophysiologically for the presence of voltage-activated, barium, currents and functional recombinant voltage-dependent calcium channels were.

1. Transfection of REX 293 cells

Separate expression vectors containing DNA encoding human neuronal calcium channel α_(1D), α₂ and β₁ subunits, plasmids pVDCCIII(A), pHBCaCHα-A, and pHBCaCHβ_(1a) RBS(A), respectively, were constructed as described in Examples II.A.3, IV.B. and III.B.3., respectively. These three vectors were used to transiently co-transfect HEK 293 cells. For stable transfection of HEK 293 cells, vector pHBCaCHβ_(1b) RBS(A) (Example III.B.3.) was used in place of pHBCaCHβ_(1b) RBS(A) to introduce the DNA encoding the β₁ subunit into the cells along with pVDCCIII(A) and pHBCaCHα₂ A.

a. Transient transfection

Expression vectors pVDCCIII(A), pHBCaCHα₂ A and pHBCaCHβ_(1a) RBS(A) were used in two sets of transient transfections of HEK 293 cells (ATCC Accession No. CRL1573). In one transfection procedure, HEK 293 cells were transiently cotransfected with the α₁ subunit cDNA expression plasmid, the α₂ subunit cDNA expression plasmid, the β₁ subunit cDNA expression plasmid and plasmid pCMVβgal (Clontech Laboratories, Palo Alto, Calif.). Plasmid pCMVβgal contains the lacZ gene (encoding E. coli β-galactosidase) fused to the cytomegalovirus (CMV) promoter and was included in this transfection as a marker gene for monitoring the efficiency of transfection. In the other transfection procedure, HEK 293 cells were transiently co-transfected with the α₁ subunit cDNA expression plasmid pVDCCIII(A) and pCMVβgal. In both transfections, 2-4×10⁶ HEK 293 cells in a 10-cm tissue culture plate were transiently co-transfected with 5 μg of each of the plasmids included in the experiment according to standard CaPO₄ precipitation transfection procedures (Wigler et al. (1979) Proc. Natl. Acad. Sci. USA 76:1373-1376). The transfectants were analyzed for β-galactosidase expression by direct staining of the product of a reaction involving β-galactosidase and the X-gal substrate Jones, J. R. (1986) EMBO 5:3133-3142! and by measurement of β-galactosidase activity Miller, J. H. (1972) Experiments in Molecular Genetics, pp. 352-355, Cold Spring Harbor Press!. To evaluate subunit cDNA expression in these transfectants,. the cells were analyzed for subunit transcript production (northern analysis), subunit protein production (immunoblot analysis of cell lysates) and functional calcium channel expression (electrophysiological analysis).

b. Stable transfection

HEK 293 cells were transfected using the calcium phosphate transfection procedure (Current Protocols In Molecular Biology, Vol. 1, Wiley Inter-Science, Supplement 14, Unit 9.1.1-9.1.9 (1990)!. Ten-cm plates, each containing ore-to-two million HEK 293 cells, were transfected with 1 ml of DNA/calcium phosphate precipitate containing 5 μg pVDCCIII(A), 5 μg pHBCaCHα2A, 5 μg pHBCaCHβ_(1b) RBS(A), 5 μg pCMVBgal and 1 μg pSV2neo (as a selectable marker). After 10-20 days of growth in media containing 500 μg G418, colonies had formed and were isolated using cloning cylinders.

2. Analysis of HEX 293 cells transiently transfected with DNA encoding human neuronal calcium channel subunits

a. Analysis of β-galactosidase expression Transient transfectants were assayed for β-galactosidase expression by β-galactosidase activity assays (Miller, J. H., (1972) Experiments in Molecular Genetics, pp. 352-355, Cold Spring Harbor Press) of cell lysates (prepared as described in Example VII.A.2) and staining of fixed cells (Jones, J. R. (1986) EMBO 5:3133-3142). The results of these assays indicated that approximately 30% of the HEK 293 cells had been transfected.

b. Northern analysis

PolyA+ RNA was isolated using the Invitrogen Fast Trak Kit (InVitrogen, San Diego, Calif.) from HEK 293 cells transiently transfected with DNA encoding each of the α₁, α₂ and β₁ subunits and the lacZ gene or the α₁ subunit and the lacz gene. The RNA was subjected to electrophoresis on an agarose gel and transferred to nitrocellulose. The nitrocellulose was then hybridized with one or more of the following radiolabeled probes: the lacZ gene, human neuronal calcium channel α_(1D) subunit encoding cDNA, human neuronal calcium channel α₂ subunit-encoding cDNA or human neuronal calcium channel β₁ subunit-encoding cDNA. Two transcripts that hybridized with the α₁ subunit-encoding cDNA were detected in HEK 293 cells transfected with the DNA encoding the α₁, α₂, and β₁ subunits and the lacz gene as well as in HEK 293 cells transfected with the α₁ subunit cDNA and the lacZ gene. One mRNA species was the size expected for the transcript of the α₁ subunit cDNA (8000 nucleotides). The second RNA species was smaller (4000 nucleotides) than the size expected for this stranscript. RNA of the size expected for the transcript of the lacZ gene was detected in cells transfected with the α₁, α₂ and β1, subunit-encoding cDNA and the lacZ gene and in cells transfected with the α₁ subunit cDNA and the lacz gene by hybridization to the lacz gene sequence.

RNA from cells transfected with the α₁, α₂ and β₁ subunit-encoding cDNA and the lacz gene was also hybridized with the α₂ and β₁ subunit cDNA probes. Two mRNA species hybridized to the α₂ subunit cDNA probe. One species was the size expected for the transcript of the α₂ subunit cDNA (4000 nucleotides). The other species was larger (6000 nucleotides) than the expected size of this transcript. Multiple RNA species in the cells co-transfected with α₁, α₂ and β₁ subunit-encoding cDNA and the lacZ gene hybridized to the β₁ subunit cDNA probe. Multiple β-subunit transcripts of varying sizes were produced since the β subunit cDNA expression vector contains two potential polyA⁺ addition sites.

c. Electrophysiological analysis

Individual transiently transfected HEK 293 cells were assayed for the presence of voltage-dependent barium currents using the whole-cell variant of the patch clamp technique (Hamill et al. (1981). Pflugers Arch. 391:85-100!. HEK 293 cells transiently transfected with pCMVβgal only were assayed for barium currents as a negative control in these experiments. The cells were placed in a bathing solution that contained barium ions to serve as the current carrier. Choline chloride, instead of NaCl or KCl, was used as the major salt component of the bath solution to eliminate currents through sodium and potassium channels. The bathing solution contained 1 mM MgCl₂ and was buffered at pH 7.3 with 10 mM HEPES (pH adjusted with sodium or tetraethylammonium hydroxide). Patch pipettes were filled with a solution containing 135 mM CsCl, 1 mM MgCl₂, 10 mM glucose, 10 mM EGTA, 4 mM ATP and 10 mM HEPES (pH adjusted to 7.3 with tetraethylammonium hydroxide). Cesium and tetraethylammonium ions block most types of potassium channels. Pipettes were coated with Sylgard (Dow-Corning, Midland, Mich.) and had resistances of 1-4 megohm. Currents were measured through a 500 megohm headstage resistor with the Axopatch IC (Axon Instruments, Foster City, Calif.) amplifier, interfaced with a Labmaster (Scientific Solutions, Solon, Ohio) data acquisition board in an IBM-compatible PC. PClamp (Axon Instruments) was used to generate voltage commands and acquire data. Data, were analyzed with pclamp or Quattro Professional (Borland International, Scotts Valley, Calif.) programs.

To apply drugs, "puffer" pipettes positioned within several micrometers of the cell under study were used to apply solutions by pressure application. The drugs used for pharmacological characterization were dissolved in a solution identical to the bathing solution. Samples of a 10 mM stock solution of Bay K 8644 (RBI, Natick, Mass.), which was prepared in DMSO, were diluted to a final concentration of 1 μM in 15 mM Ba²⁺ -containing bath, solution before they were applied.

Twenty-one negative control HEK 293 cells (transiently transfected with the lacZ gene expression vector pCMVβgal only) were analyzed by the whole-cell variant of the patch clamp method for recording currents. Only one cell displayed a discernable inward barium current; this current was not affected by the presence of 1 μM Bay K 8644. In addition, application of Bay K 8644 to four cells that did not display Ba²⁺ currents did not result in the appearance of any currents.

Two days after transient transfection of HEK 293 cells with α₁, α₂ and β₁ subunit-encoding cDNA and the lacZ gene, individual transfectants were assayed for voltage dependent barium currents. The currents in nine transfectants were recorded. Because the efficiency of transfection of one cell can vary from the efficiency of transfection of another cell, the degree of expression of heterologous proteins in individual transfectants varies and some cells do not incorporate or express the foreign DNA. Inward barium currents were detected in two of these nine transfectants. In these assays, the holding potential of the membrane was -90 mV. The membrane was depolarized in a series of voltage steps to different test potentials and the current in the presence and absence of 1 μM Bay K 8644 was recorded. The inward barium current was significantly enhanced in magnitude by the addition of Bay K 8644. The largest inward barium current (˜160 pA) was recorded when the membrane was depolarized to 0 mV in the presence of 1 μM Bay K 8644. A comparison of the I-V curves, generated by plotting the largest current recorded after each depolarization versus the depolarization voltage, corresponding to recordings conducted in the absence and presence of Bay K 8644 illustrated the enhancement of the voltage-activated current in the presence of Bay K 8644.

Pronounced tail currents were detected in the tracings of currents generated in the presence of Bay K 8644 in HEK 293 cells transfected with α₁, α₂ and β₁ subunit-encoding cDNA and the lacZ gene, indicating that the recombinant calcium channels responsible for the voltage-activated barium currents recorded in this transfected appear to be DHP-sensitive.

The second of the two transfected cells that displayed inward barium currents expressed a -50 pA current when the membrane was depolarized from -9.0 mV. This current was nearly completely blocked by 200 μM cadmium, an established calcium channel blocker.

Ten cells that were transiently transfected with the DNA encoding the α₁ subunit and the lacz gene were analyzed by whole-cell patch clamp methods two days after transfection. One of these cells displayed a 30 pA inward barium current. This current amplified 2-fold in the presence of 1 μM Bay K 8644. Furthermore, small tail currents were detected in the presence of Bay K 8644. These data indicate that expression of the human neuronal calcium channel α_(1D) subunit-encoding cDNA in HEK 293 yields a functional DHP-sensitive calcium channel.

3. Analysis of REK 293 cells stably transfected with DNA encoding human neuronal calcium channel subunits

Individual stably transfected HEK 293 cells were assayed electrophysiologically for the presence of voltage-dependent barium currents as described for electrophysiological analysis of transiently transfected HEK 293 cells (see Example VII.B.2.c). In an effort to maximize calcium channel activity via cyclic-AMP-dependent kinase-mediated phosphorylation Pelzer, et al. (1990) Rev. Physiol. Biochem. Pharmacol. 114:107-207!, cAMP (Na salt, 250 μM) was added to the pipet solution and forskolin (10 μM) was added to the bath solution in some of the recordings. Qualitatively similar results were obtained whether these compounds were present or not.

Barium currents were recorded from stably transfected cells in the absence and presence of Bay K 8644 (1 μM). When the cell was depolarized to -10 mV from a holding potential of -90 mV in the absence of Bay K 8644, a current of approximately 35 pA with a rapidly deactivating tail current was recorded. During application of Bay K 8644, an identical depolarizing protocol elicited a current of approximately 75 pA, accompanied by an augmented and prolonged tail current. The peak magnitude of currents recorded from this same cell as a function of a series of depolarizing voltages were assessed. The responses in the presence of Bay K 8644 not only increased, but the entire current-voltage relation shifted about -10 mV. Thus, three typical hallmarks of Bay K 8644 action, namely increased current magnitude, prolonged tail currents, are negatively shifted activation voltage, were observed, clearly indicating the expression of a DHP-sensitive calcium channel in these stably transfected cells. No such effects of Bay K 8644 were observed in untransfected HEK 293 cells, either with or without cAMP or forskolin.

C. Use of pCMV-based vectors and pcDNA1-based vectors for expression of DNA encoding human neuronal calcium channel subunits

1. Preparation of constructs

To determine if the levels of recombinant expression of human calcium channel subunit-encoding DNA in host cells could be enhanced by using pCMV-based instead of pcDNA1-based expression vectors, additional expression vectors were constructed. The full-length α_(1D) cDNA from pVDCCIII(A) (see Example II.A.3.d), the full-length α₂ cDNA, contained on a 3600 bp EcoRI fragment from HBCaCHα₂ (see Example IV.B) and a full-length β₁ subunit cDNA from pHDCaCHβ_(1b) RBS(A) (see Example III.B.3) were separately subcloned into plasmid pCMVβgal. Plasmid pCMVβgal was digested with NotI to remove the lacz gene. The remaining vector portion of the plasmid, referred to as pCMV, was blunt-ended at the NotI sites. The full-length α₂ -encoding DNA and β₁ -encoding DNA, contained on separate EcoRI fragments, were isolated, blunt-ended and separately ligated to the blunt-ended vector fragment of pCMV locating the cDNAs between the CMV promoter and SV40 polyadenylation sites in pCMV. To ligate the α_(1D) -encoding cDNA with pCMV, the restriction sites in the polylinkers immediately 5' of the CMV promoter and immediately 3' of the SV40 polyadenylation site were removed from pCMV. A polylinker was added at the NotI site. The polylinker had the following sequence of restriction enzyme recognition sites: ##STR1## The α_(1D) -encoding DNA, isolated as a BamHI/XhoI fragment from pVDCCIII(A), was then ligated to XbaII/SalI-digested pCMV to place it between the CMV promoter and SV40 polyadanylation site.

Plasmid pCMV contains the CMV promoter as does pcDNA1, but differs from pcDNA1 in the location of splice donor/splice acceptor sites relative to the inserted subunit-encoding DNA. After inserting the subunit-encoding DNA into pCMV, the splice donor/splice acceptor sites are located 3' of the CMV promoter and 5' of the subunit-encoding DNA start codon. After inserting the subunit-encoding DNA into pcDNA1, the splice donor/splice acceptor sites are located 3' of the subunit cDNA stop codon.

2. Transfection of HEK 293 cells

HEK 293 cells were transiently co-transfected with the α_(1D), α₂ and β₁ subunit-encoding DNA in pCMV or with the α_(1D), α₂ and β subunit-encoding DNA in pcDNA1 (vectors pVDCCIII(A), pHBCaCHα₂ A and pHBCaCHβ_(1b) RBS (A), respectively), as described in Example VII.B.l.a. Plasmid pCMVβgal was included in each transfection to as a measure of transfection efficiency. The results of β-galactosidase assays of the transfectants (see Example VII.B.2.), indicated that HEX 293 cells were transfected equally efficiently with pCMV- and pcDNA1-based plasmids.

3. Northern analysis

Total and polyA⁺ RNA were isolated from the transiently transfected cells as described in Examples VII.A.2 and VII.B.2.b. Northern blots of the RNA were hybridized with the following radiolabeled probes: α_(1D) cDNA, human neuronal calcium channel α₂ subunit cDNA and DNA encoding the human neuronal calcium channel β₁ subunit. Messenger RNA of sizes expected for α_(1D), α₂ and β₁ subunit transcripts were detected in all transfectants. A greater amount of the α_(1D) transcript was present in cells that were co-transfected with pCMV-based plasmids then in cells that were co-transfected with pcDNA1-based plasmids. Equivalent amounts of α₂ and β₁ subunit transcripts were detected in all transfectants.

D. Expression in Xenopus laevis oocytes of RNA encoding human neuronal calcium channel subunits

Various combinations of the transcripts of DNA encoding the human neuronal α_(1D), α₂ and β₁ subunits prepared in vitro were injected into Xenopus laevis oocytes. Those injected with combinations that included a D exhibited voltage-activated barium currents.

1. Preparation of transcripts

Transcripts encoding the human neuronal calcium channel α_(1D), α₂ and β₁ subunits were synthesized according to the instructions of the mCAP mRNA CAPPING KIT (Strategene, La Jolla, Calif. catalog #200350). Plasmids PVDCC III.RBS(A), containing of pcDNA1 and the α_(1D) cDNA that begins with a ribosome binding site and the eighth ATG codon of the coding sequence (see Example III.A.3.d), plasmid pHBCaCHα₁ A containing of pcDNA1 and an α2 subunit cDNA (see Example IV), and plasmid pHBCaCHPα₁ RBS(A) containing pcDNA1 and the β₁ DNA lacking intron sequence and containing a ribosome binding site (see Example III), were linearized by restriction digestion. The α_(1D) cDNA- and a₂ subunit-encoding plasmids were digested with XhoI, and the β₁ subunit-encoding plasmid was digested with EcoRV. The DNA insert was transcribed with T7 RNA polymerase.

2. Injection of oocytes

Xenopus laevis oocytes were isolated and defolliculated by collagenase treatment and maintained in 100 mM NaCl, 2 mM KCl, 1.8 mM CaCl₂, 1 mM MgCl₂, 5 mM HEPES, pH 7.6, 20 μg/ml ampicillin and 25 μg/ml streptomycin at 19°-25° C. for 2 to 5 days after, injection and prior to recording. For each transcript that was injected into the oocyte, 6 ng of the specific mRNA was injected per cell in a total volume of 50 nl.

3. Intracellular voltage recordings

Injected oocytes were examined for voltage-dependent barium currents using two-electrode voltage clamp methods Dascal, N. (1987) CRC Crit. Rev. Biochem. 22:317!. The pclamp (Axon Instruments) software package was used in conjunction with a Labmaster 125 kHz data acquisition interface to generate voltage commands and to acquire and analyze data. Quattro Professional was also used in this analysis. Current signals were digitized at 1-5 kHz, and filtered appropriately. The bath solution contained of the following: 40 mM BaCl₂, 36 mM tetraethylammonium chloride (TEA-Cl), 2 mr KCl, 5 mM 4-aminopyridine, 0.15 mM niflumic acid, 5 mM HEPES, pH 7.6.

a. Electrophysiological analysis of oocytes injected with transcripts encoding the human neuronal calcium channel α₁, α₂ and β₁ -subunits

Uninjected oocytes were examined by two-electrode voltage clamp methods and a very small (25 nA) endogenous inward Ba²⁺ current was detected in only one of seven analyzed cells.

Oocytes coinjected with α_(1D), α₂ and β₁ subunit transcripts expressed sustained inward barium currents upon depolarization of the membrane from a holding potential of -90 mV or -50 mV (154±129 nA, n=21). These currents typically showed little inactivation when test pulses ranging from 140 to 700 msec. were administered. Depolarization to a series of voltages revealed currents that first appeared at approximately -30 mV and peaked at approximately 0 mV.

Application of the DHP Bay K 8644 increased the magnitude of the currents, prolonged the tail currents present upon repolarization of the cell and induced a hyperpolarizing shift in current activation. Bay K 8644 was prepared fresh from a stock solution in DMSO and introduced as a lox concentrate directly into the CO μl bath while the perfusion pump was turned off. The DMSO concentration of the final diluted drug solutions in contact with the cell never exceeded 0.1%. Control experiments showed that 0.1% DMSO had no effect on membrane currents.

Application of the DHP antagonist nifedipine (stock solution prepared in DMSO and applied to the cell as described for application of Bay K 8644) blocked a substantial fraction (91±6%, n=7) of the inward barium current in oocytes coinjected with transcripts of the α_(1D), β₂ and β₁ subunits. A residual inactivating component of the inward barium current typically remained- after nifedipine application. The inward barium current was blocked completely by 50 μM Cd²⁺, but only approximately 15% by 100 μM Ni²⁺.

The effect of ωCgTX on the inward barium currents in oocytes co-injected with transcripts of the α_(1D), α₂, and β₁ subunits was investigated. ωCgTX (Bachem, Inc., Torrance Calif.) was prepared in the 15 mM BaCd₂ bath solution plus 0.1% cytochrome C (sigma) to serve as a carrier protein.

Control experiments showed that cytochrome C had no effect on currents. A series of voltage pulses from a. -90 mV holding potential to 0 mV were recorded at 20 msec. intervals. To reduce the inhibition of ωCgTX binding by divalent cations, recordings were made in 15 mM BaCl₂, 73.5 mM tetraethylammonium chloride, and the remaining ingredients identical to the 40 mM Ba²⁺ recording solution. Bay K 8644 was applied to the cell prior to addition to ωCgTX in order to determine the effect of ωCGTX on the DHP-sensitive current component that was distinguished by the prolonged tail currents. The inward, barium current was blocked weakly (54±29%, n=7) and reversibly by relatively high concentrations (10-15 μM) of ωCgTX. The test currents and the accompanying tail currents were blocked progressively within two to three minutes after application of ωCgTX, but both recovered partially as the ωcgTX was flushed from the bath.

b. Analysis of oocytes injected with only a transcripts encoding the human neuronal calcium channel α_(1D) or transcripts encoding an α_(1D) and other subunits

The contribution of the α₂ and β₁ subunits to the inward barium current in oocytes injected with transcripts encoding the α_(1D), β₂ and β₁ subunits was assessed by expression of the α_(1D) subunit alone or in combination with either the β₁ subunit or the α₂ subunit. In oocytes injected with only the transcript of a α_(1D) cDNA, no Ba²⁺ currents were detected (n-3). In oocytes injected with transcripts of α_(1D) and β₁ cDNAs, small (108±39 nA) Ba²⁺ currents were detected upon depolarization of the membrane from a holding potential of -90 mV that resembled the currents observed in cells injected with transcripts of α_(1D), α₂ and β₁ cDNAs, although the magnitude of the current was less. In two of the four oocytes injected with transcripts of the α_(1D) -encoding and β₁ -encoding DNA, the Ba²⁺ currents exhibited a sensitivity to Bay X 8644 that was similar to the Bay K 8644 sensitivity of Ba²⁺ currents expressed in oocytes injected with transcripts encoding the α_(1D) α₁ -, α₂, and β₁ subunits.

Three of five o8cytes injected with transcripts encoding the α_(1D) and α₂ subunits exhibited very small Ba²⁺ currents (15-30 nA) upon depolarization of the membrane from a holding potential of -90 mV. These barium currents showed little or no response to Bay K 8644.

C. Analysis of oocytes injected with transcripts encoding the human neuronal calcium channel α₂ and/or β₁ subunit

To evaluate the contribution of the α_(1D) α₁ -subunit to the inward barium currents detected in oocytes co-injected with transcripts encoding the α_(1D), α₂ and β₁ subunits, oocytes injected with transcripts encoding the human neuronal calcium channel α₂ and/or β₁ subunits were assayed for barium currents. Oocytes injected with transcripts encoding the α₂ subunit displayed no detectable inward barium currents (n-5). Oocytes injected with transcripts encoding a β₁ subunit displayed measurable (54±23 nA, n=S) inward barium currents upon depolarization and oocytes injected with transcripts encoding the α₂ and β₁ subunits displayed inward barium currents that were approximately 50% larger (80±61 nA, n=18) than those detected in oocytes injected with transcripts of the β₁ -encoding DNA only.

The inward barium currents in oocytes injected with transcripts encoding the β₁ subunit or α₂ and β₁ subunits typically were first observed when the, membrane was depolarized to -30 mV from a holding potential of -90 mV and peaked when the membrane was depolarized to 10 to 20 mV. Macroscopically, the currents in oocytes injected with transcripts encoding the α₂ and β₁ subunits or with transcripts encoding the β₁ subunit were indistinguishable. In contrast to the currents in oocytes co-injected with transcripts of α_(1D), α₂ and β₁ subunit cDNAs, these currents showed a significant inactivation during the test pulse and a strong sensitivity to the holding potential. The inward barium currents in oocytes co-injected with transcripts encoding the α₂ and β₁ subunits usually inactivated to 10-60% of the peak magnitude during a 140-msec pulse and were significantly more sensitive to holding potential than those in oocytes co-injected with transcripts encoding the α_(1D), α₂ and β1 subunits. Changing the holding potential of the membranes of oocytes co-injected with transcripts encoding the α₂ and β₁ subunits from -90 to -50 mV resulted in an approximately 81% (n=11) reduction in the magnitude of the inward barium current of these cells. In contrast, the inward barium current measured in oocytes co-injected with transcripts encoding the α_(1D), α₂ and β₁ subunits were reduced approximately 24% (n-11) when the holding potential was changed from -90 to -50 mV.

The inward barium currents detected in oocytes injected with transcripts encoding the α₂ and β₁ subunits were pharmacologically distinct from those observed in oocytes co-injected with transcripts encoding the α_(1D), α₂ and β₁ subunits. Oocytes injected with transcripts encoding the α₂ and β₁ subunits displayed inward barium currents that were insensitive to Bay K 8644 (n=11). Nifedipine sensitivity was difficult to measure because of the holding potential sensitivity of nifedipine and the current observed in oocytes injected with transcripts encoding the β₂ and β₁ subunits. Nevertheless, two oocytes that were co-injected with transcripts encoding the α₂ and β₁ subunits displayed measurable (25 to 45 nA) inward barium currents when depolarized from a holding potential of -50 mV. These currents were insensitive to nifedipine (5 to 10 μM). The inward barium currents in oocytes injected with transcripts encoding the α₂ and β₁ subunits showed the same sensitivity to heavy metals as the currents detected in oocytes injected with transcripts encoding the α_(1D), α₂ and β₁ subunits.

The inward barium current detected in oocytes injected with transcripts encoding the human neuronal α₂ and β₁ subunits has pharmacological and biophysical properties that resemble calcium currents in uninjected Xenopus oocytes. Because the amino acids of this human neuronal calcium channel β₁ subunit lack hydrophobic segments capable of forming transmembrane domains, it is unlikely that recombinant β₁ subunits alone can form an ion channel. It is more probable that a homologous endogenous α₁ subunit exists in oocytes and that the activity mediated by such an α₁ subunit is enhanced by expression of a human neuronal β₁ subunit.

While the invention has been described with some specificity, modifications apparent to those with ordinary skill in the art may be made without departing from the scope of the invention. Since such modifications will be apparent to those of skill in the art, it is intended that this invention be limited only by the scope of the appended claims.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 29                                                  (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7635 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 511..6996                                                        (ix) FEATURE:                                                                  (A) NAME/KEY: 5'UTR                                                            (B) LOCATION: 1..510                                                           (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 6994..7635                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        GGGCGAGCGCCTCCGTCCCCGGATGTGAGCTCCGGCTGCCCGCGGTCCCGAGCCAGCGGC60                 GCGCGGGCGGCGGCGGCGGGCACCGGGCACCGCGGCGGGCGGGCAGACGGGCGGGCATGG120                GGGGAGCGCCGAGCGGCCCCGGCGGCCGGGCCGGCATCACCGCGGCGTCTCTCCGCTAGA180                GGAGGGGACAAGCCAGTTCTCCTTTGCAGCAAAAAATTACATGTATATATTATTAAGATA240                ATATATACATTGGATTTTATTTTTTTAAAAAGTTTATTTTGCTCCATTTTTGAAAAAGAG300                AGAGCTTGGGTGGCGAGCGGTTTTTTTTTAAAATCAATTATCCTTATTTTCTGTTATTTG360                TCCCCGTCCCTCCCCACCCCCCTGCTGAAGCGAGAATAAGGGCAGGGACCGCGGCTCCTA420                CCTCTTGGTGATCCCCTTCCCCATTCCGCCCCCGCCCCAACGCCCAGCACAGTGCCCTGC480                ACACAGTAGTCGCTCAATAAATGTTCGTGGATGATGATGATGATGATGATGAAA534                      MetMetMetMetMetMetMetLys                                                       15                                                                             AAAATGCAGCATCAACGGCAGCAGCAAGCGGACCACGCGAACGAGGCA582                            LysMetGlnHisGlnArgGlnGlnGlnAlaAspHisAlaAsnGluAla                               101520                                                                         AACTATGCAAGAGGCACCAGACTTCCTCTTTCTGGTGAAGGACCAACT630                            AsnTyrAlaArgGlyThrArgLeuProLeuSerGlyGluGlyProThr                               25303540                                                                       TCTCAGCCGAATAGCTCCAAGCAAACTGTCCTGTCTTGGCAAGCTGCA678                            SerGlnProAsnSerSerLysGlnThrValLeuSerTrpGlnAlaAla                               455055                                                                         ATCGATGCTGCTAGACAGGCCAAGGCTGCCCAAACTATGAGCACCTCT726                            IleAspAlaAlaArgGlnAlaLysAlaAlaGlnThrMetSerThrSer                               606570                                                                         GCACCCCCACCTGTAGGATCTCTCTCCCAAAGAAAACGTCAGCAATAC774                            AlaProProProValGlySerLeuSerGlnArgLysArgGlnGlnTyr                               758085                                                                         GCCAAGAGCAAAAAACAGGGTAACTCGTCCAACAGCCGACCTGCCCGC822                            AlaLysSerLysLysGlnGlyAsnSerSerAsnSerArgProAlaArg                               9095100                                                                        GCCCTTTTCTGTTTATCACTCAATAACCCCATCCGAAGAGCCTGCATT870                            AlaLeuPheCysLeuSerLeuAsnAsnProIleArgArgAlaCysIle                               105110115120                                                                   AGTATAGTGGAATGGAAACCATTTGACATATTTATATTATTGGCTATT918                            SerIleValGluTrpLysProPheAspIlePheIleLeuLeuAlaIle                               125130135                                                                      TTTGCCAATTGTGTGGCCTTAGCTATTTACATCCCATTCCCTGAAGAT966                            PheAlaAsnCysValAlaLeuAlaIleTyrIleProPheProGluAsp                               140145150                                                                      GATTCTAATTCAACAAATCATAACTTGGAAAAAGTAGAATATGCCTTC1014                           AspSerAsnSerThrAsnHisAsnLeuGluLysValGluTyrAlaPhe                               155160165                                                                      CTGATTATTTTTACAGTCGAGACATTTTTGAAGATTATAGCGTATGGA1062                           LeuIleIlePheThrValGluThrPheLeuLysIleIleAlaTyrGly                               170175180                                                                      TTATTGCTACATCCTAATGCTTATGTTAGGAATGGATGGAATTTACTG1110                           LeuLeuLeuHisProAsnAlaTyrValArgAsnGlyTrpAsnLeuLeu                               185190195200                                                                   GATTTTGTTATAGTAATAGTAGGATTGTTTAGTGTAATTTTGGAACAA1158                           AspPheValIleValIleValGlyLeuPheSerValIleLeuGluGln                               205210215                                                                      TTAACCAAAGAAACAGAAGGCGGGAACCACTCAAGCGGCAAATCTGGA1206                           LeuThrLysGluThrGluGlyGlyAsnHisSerSerGlyLysSerGly                               220225230                                                                      GGCTTTGATGTCAAAGCCCTCCGTGCCTTTCGAGTGTTGCGACCACTT1254                           GlyPheAspValLysAlaLeuArgAlaPheArgValLeuArgProLeu                               235240245                                                                      CGACTAGTGTCAGGAGTGCCCAGTTTACAAGTTGTCCTGAACTCCATT1302                           ArgLeuValSerGlyValProSerLeuGlnValValLeuAsnSerIle                               250255260                                                                      ATAAAAGCCATGGTTCCCCTCCTTCACATAGCCCTTTTGGTATTATTT1350                           IleLysAlaMetValProLeuLeuHisIleAlaLeuLeuValLeuPhe                               265270275280                                                                   GTAATCATAATCTATGCTATTATAGGATTGGAACTTTTTATTGGAAAA1398                           ValIleIleIleTyrAlaIleIleGlyLeuGluLeuPheIleGlyLys                               285290295                                                                      ATGCACAAAACATGTTTTTTTGCTGACTCAGATATCGTAGCTGAAGAG1446                           MetHisLysThrCysPhePheAlaAspSerAspIleValAlaGluGlu                               300305310                                                                      GACCCAGCTCCATGTGCGTTCTCAGGGAATGGACGCCAGTGTACTGCC1494                           AspProAlaProCysAlaPheSerGlyAsnGlyArgGlnCysThrAla                               315320325                                                                      AATGGCACGGAATGTAGGAGTGGCTGGGTTGGCCCGAACGGAGGCATC1542                           AsnGlyThrGluCysArgSerGlyTrpValGlyProAsnGlyGlyIle                               330335340                                                                      ACCAACTTTGATAACTTTGCCTTTGCCATGCTTACTGTGTTTCAGTGC1590                           ThrAsnPheAspAsnPheAlaPheAlaMetLeuThrValPheGlnCys                               345350355360                                                                   ATCACCATGGAGGGCTGGACAGACGTGCTCTACTGGATGAATGATGCT1638                           IleThrMetGluGlyTrpThrAspValLeuTyrTrpMetAsnAspAla                               365370375                                                                      ATGGGATTTGAATTGCCCTGGGTGTATTTTGTCAGTCTCGTCATCTTT1686                           MetGlyPheGluLeuProTrpValTyrPheValSerLeuValIlePhe                               380385390                                                                      GGGTCATTTTTCGTACTAAATCTTGTACTTGGTGTATTGAGCGGAGAA1734                           GlySerPhePheValLeuAsnLeuValLeuGlyValLeuSerGlyGlu                               395400405                                                                      TTCTCAAAGGAAAGAGAGAAGGCAAAAGCACGGGGAGATTTCCAGAAG1782                           PheSerLysGluArgGluLysAlaLysAlaArgGlyAspPheGlnLys                               410415420                                                                      CTCCGGGAGAAGCAGCAGCTGGAGGAGGATCTAAAGGGCTACTTGGAT1830                           LeuArgGluLysGlnGlnLeuGluGluAspLeuLysGlyTyrLeuAsp                               425430435440                                                                   TGGATCACCCAAGCTGAGGACATCGATCCGGAGAATGAGGAAGAAGGA1878                           TrpIleThrGlnAlaGluAspIleAspProGluAsnGluGluGluGly                               445450455                                                                      GGAGAGGAAGGCAAACGAAATACTAGCATGCCCACCAGCGAGACTGAG1926                           GlyGluGluGlyLysArgAsnThrSerMetProThrSerGluThrGlu                               460465470                                                                      TCTGTGAACACAGAGAACGTCAGCGGTGAAGGCGAGAACCGAGGCTGC1974                           SerValAsnThrGluAsnValSerGlyGluGlyGluAsnArgGlyCys                               475480485                                                                      TGTGGAAGTCTCTGTCAAGCCATCTCAAAATCCAAACTCAGCCGACGC2022                           CysGlySerLeuCysGlnAlaIleSerLysSerLysLeuSerArgArg                               490495500                                                                      TGGCGTCGCTGGAACCGATTCAATCGCAGAAGATGTAGGGCCGCCGTG2070                           TrpArgArgTrpAsnArgPheAsnArgArgArgCysArgAlaAlaVal                               505510515520                                                                   AAGTCTGTCACGTTTTACTGGCTGGTTATCGTCCTGGTGTTTCTGAAC2118                           LysSerValThrPheTyrTrpLeuValIleValLeuValPheLeuAsn                               525530535                                                                      ACCTTAACCATTTCCTCTGAGCACTACAATCAGCCAGATTGGTTGACA2166                           ThrLeuThrIleSerSerGluHisTyrAsnGlnProAspTrpLeuThr                               540545550                                                                      CAGATTCAAGATATTGCCAACAAAGTCCTCTTGGCTCTGTTCACCTGC2214                           GlnIleGlnAspIleAlaAsnLysValLeuLeuAlaLeuPheThrCys                               555560565                                                                      GAGATGCTGGTAAAAATGTACAGCTTGGGCCTCCAAGCATATTTCGTC2262                           GluMetLeuValLysMetTyrSerLeuGlyLeuGlnAlaTyrPheVal                               570575580                                                                      TCTCTTTTCAACCGGTTTGATTGCTTCGTGGTGTGTGGTGGAATCACT2310                           SerLeuPheAsnArgPheAspCysPheValValCysGlyGlyIleThr                               585590595600                                                                   GAGACGATCTTGGTGGAACTGGAAATCATGTCTCCCCTGGGGATCTCT2358                           GluThrIleLeuValGluLeuGluIleMetSerProLeuGlyIleSer                               605610615                                                                      GTGTTTCGGTGTGTGCGCCTCTTAAGAATCTTCAAAGTGACCAGGCAC2406                           ValPheArgCysValArgLeuLeuArgIlePheLysValThrArgHis                               620625630                                                                      TGGACTTCCCTGAGCAACTTAGTGGCATCCTTATTAAACTCCATGAAG2454                           TrpThrSerLeuSerAsnLeuValAlaSerLeuLeuAsnSerMetLys                               635640645                                                                      TCCATCGCTTCGCTGTTGCTTCTGCTTTTTCTCTTCATTATCATCTTT2502                           SerIleAlaSerLeuLeuLeuLeuLeuPheLeuPheIleIleIlePhe                               650655660                                                                      TCCTTGCTTGGGATGCAGCTGTTTGGCGGCAAGTTTAATTTTGATGAA2550                           SerLeuLeuGlyMetGlnLeuPheGlyGlyLysPheAsnPheAspGlu                               665670675680                                                                   ACGCAAACCAAGCGGAGCACCTTTGACAATTTCCCTCAAGCACTTCTC2598                           ThrGlnThrLysArgSerThrPheAspAsnPheProGlnAlaLeuLeu                               685690695                                                                      ACAGTGTTCCAGATCCTGACAGGCGAAGACTGGAATGCTGTGATGTAC2646                           ThrValPheGlnIleLeuThrGlyGluAspTrpAsnAlaValMetTyr                               700705710                                                                      GATGGCATCATGGCTTACGGGGGCCCATCCTCTTCAGGAATGATCGTC2694                           AspGlyIleMetAlaTyrGlyGlyProSerSerSerGlyMetIleVal                               715720725                                                                      TGCATCTACTTCATCATCCTCTTCATTTGTGGTAACTATATTCTACTG2742                           CysIleTyrPheIleIleLeuPheIleCysGlyAsnTyrIleLeuLeu                               730735740                                                                      AATGTCTTCTTGGCCATCGCTGTAGACAATTTGGCTGATGCTGAAAGT2790                           AsnValPheLeuAlaIleAlaValAspAsnLeuAlaAspAlaGluSer                               745750755760                                                                   CTGAACACTGCTCAGAAAGAAGAAGCGGAAGAAAAGGAGAGGAAAAAG2838                           LeuAsnThrAlaGlnLysGluGluAlaGluGluLysGluArgLysLys                               765770775                                                                      ATTGCCAGAAAAGAGAGCCTAGAAAATAAAAAGAACAACAAACCAGAA2886                           IleAlaArgLysGluSerLeuGluAsnLysLysAsnAsnLysProGlu                               780785790                                                                      GTCAACCAGATAGCCAACAGTGACAACAAGGTTACAATTGATGACTAT2934                           ValAsnGlnIleAlaAsnSerAspAsnLysValThrIleAspAspTyr                               795800805                                                                      AGAGAAGAGGATGAAGACAAGGACCCCTATCCGCCTTGCGATGTGCCA2982                           ArgGluGluAspGluAspLysAspProTyrProProCysAspValPro                               810815820                                                                      GTAGGGGAAGAGGAAGAGGAAGAGGAGGAGGATGAACCTGAGGTTCCT3030                           ValGlyGluGluGluGluGluGluGluGluAspGluProGluValPro                               825830835840                                                                   GCCGGACCCCGTCCTCGAAGGATCTCGGAGTTGAACATGAAGGAAAAA3078                           AlaGlyProArgProArgArgIleSerGluLeuAsnMetLysGluLys                               845850855                                                                      ATTGCCCCCATCCCTGAAGGGAGCGCTTTCTTCATTCTTAGCAAGACC3126                           IleAlaProIleProGluGlySerAlaPhePheIleLeuSerLysThr                               860865870                                                                      AACCCGATCCGCGTAGGCTGCCACAAGCTCATCAACCACCACATCTTC3174                           AsnProIleArgValGlyCysHisLysLeuIleAsnHisHisIlePhe                               875880885                                                                      ACCAACCTCATCCTTGTCTTCATCATGCTGAGCAGTGCTGCCCTGGCC3222                           ThrAsnLeuIleLeuValPheIleMetLeuSerSerAlaAlaLeuAla                               890895900                                                                      GCAGAGGACCCCATCCGCAGCCACTCCTTCCGGAACACGATACTGGGT3270                           AlaGluAspProIleArgSerHisSerPheArgAsnThrIleLeuGly                               905910915920                                                                   TACTTTGACTATGCCTTCACAGCCATCTTTACTGTTGAGATCCTGTTG3318                           TyrPheAspTyrAlaPheThrAlaIlePheThrValGluIleLeuLeu                               925930935                                                                      AAGATGACAACTTTTGGAGCTTTCCTCCACAAAGGGGCCTTCTGCAGG3366                           LysMetThrThrPheGlyAlaPheLeuHisLysGlyAlaPheCysArg                               940945950                                                                      AACTACTTCAATTTGCTGGATATGCTGGTGGTTGGGGTGTCTCTGGTG3414                           AsnTyrPheAsnLeuLeuAspMetLeuValValGlyValSerLeuVal                               955960965                                                                      TCATTTGGGATTCAATCCAGTGCCATCTCCGTTGTGAAGATTCTGAGG3462                           SerPheGlyIleGlnSerSerAlaIleSerValValLysIleLeuArg                               970975980                                                                      GTCTTAAGGGTCCTGCGTCCCCTCAGGGCCATCAACAGAGCAAAAGGA3510                           ValLeuArgValLeuArgProLeuArgAlaIleAsnArgAlaLysGly                               9859909951000                                                                  CTTAAGCACGTGGTCCAGTGCGTCTTCGTGGCCATCCGGACCATCGGC3558                           LeuLysHisValValGlnCysValPheValAlaIleArgThrIleGly                               100510101015                                                                   AACATCATGATCGTCACCACCCTCCTGCAGTTCATGTTTGCCTGTATC3606                           AsnIleMetIleValThrThrLeuLeuGlnPheMetPheAlaCysIle                               102010251030                                                                   GGGGTCCAGTTGTTCAAGGGGAAGTTCTATCGCTGTACGGATGAAGCC3654                           GlyValGlnLeuPheLysGlyLysPheTyrArgCysThrAspGluAla                               103510401045                                                                   AAAAGTAACCCTGAAGAATGCAGGGGACTTTTCATCCTCTACAAGGAT3702                           LysSerAsnProGluGluCysArgGlyLeuPheIleLeuTyrLysAsp                               105010551060                                                                   GGGGATGTTGACAGTCCTGTGGTCCGTGAACGGATCTGGCAAAACAGT3750                           GlyAspValAspSerProValValArgGluArgIleTrpGlnAsnSer                               1065107010751080                                                               GATTTCAACTTCGACAACGTCCTCTCTGCTATGATGGCGCTCTTCACA3798                           AspPheAsnPheAspAsnValLeuSerAlaMetMetAlaLeuPheThr                               108510901095                                                                   GTCTCCACGTTTGAGGGCTGGCCTGCGTTGCTGTATAAAGCCATCGAC3846                           ValSerThrPheGluGlyTrpProAlaLeuLeuTyrLysAlaIleAsp                               110011051110                                                                   TCGAATGGAGAGAACATCGGCCCAATCTACAACCACCGCGTGGAGATC3894                           SerAsnGlyGluAsnIleGlyProIleTyrAsnHisArgValGluIle                               111511201125                                                                   TCCATCTTCTTCATCATCTACATCATCATTGTAGCTTTCTTCATGATG3942                           SerIlePhePheIleIleTyrIleIleIleValAlaPhePheMetMet                               113011351140                                                                   AACATCTTTGTGGGCTTTGTCATCGTTACATTTCAGGAACAAGGAGAA3990                           AsnIlePheValGlyPheValIleValThrPheGlnGluGlnGlyGlu                               1145115011551160                                                               AAAGAGTATAAGAACTGTGAGCTGGACAAAAATCAGCGTCAGTGTGTT4038                           LysGluTyrLysAsnCysGluLeuAspLysAsnGlnArgGlnCysVal                               116511701175                                                                   GAATACGCCTTGAAAGCACGTCCCTTGCGGAGATACATCCCCAAAAAC4086                           GluTyrAlaLeuLysAlaArgProLeuArgArgTyrIleProLysAsn                               118011851190                                                                   CCCTACCAGTACAAGTTCTGGTACGTGGTGAACTCTTCGCCTTTCGAA4134                           ProTyrGlnTyrLysPheTrpTyrValValAsnSerSerProPheGlu                               119512001205                                                                   TACATGATGTTTGTCCTCATCATGCTCAACACACTCTGCTTGGCCATG4182                           TyrMetMetPheValLeuIleMetLeuAsnThrLeuCysLeuAlaMet                               121012151220                                                                   CAGCACTACGAGCAGTCCAAGATGTTCAATGATGCCATGGACATTCTG4230                           GlnHisTyrGluGlnSerLysMetPheAsnAspAlaMetAspIleLeu                               1225123012351240                                                               AACATGGTCTTCACCGGGGTGTTCACCGTCGAGATGGTTTTGAAAGTC4278                           AsnMetValPheThrGlyValPheThrValGluMetValLeuLysVal                               124512501255                                                                   ATCGCATTTAAGCCTAAGGGGTATTTTAGTGACGCCTGGAACACGTTT4326                           IleAlaPheLysProLysGlyTyrPheSerAspAlaTrpAsnThrPhe                               126012651270                                                                   GACTCCCTCATCGTAATCGGCAGCATTATAGACGTGGCCCTCAGCGAA4374                           AspSerLeuIleValIleGlySerIleIleAspValAlaLeuSerGlu                               127512801285                                                                   GCAGACCCAACTGAAAGTGAAAATGTCCCTGTCCCAACTGCTACACCT4422                           AlaAspProThrGluSerGluAsnValProValProThrAlaThrPro                               129012951300                                                                   GGGAACTCTGAAGAGAGCAATAGAATCTCCATCACCTTTTTCCGTCTT4470                           GlyAsnSerGluGluSerAsnArgIleSerIleThrPhePheArgLeu                               1305131013151320                                                               TTCCGAGTGATGCGATTGGTGAAGCTTCTCAGCAGGGGGGAAGGCATC4518                           PheArgValMetArgLeuValLysLeuLeuSerArgGlyGluGlyIle                               132513301335                                                                   CGGACATTGCTGTGGACTTTTATTAAGTTCTTTCAGGCGCTCCCGTAT4566                           ArgThrLeuLeuTrpThrPheIleLysPhePheGlnAlaLeuProTyr                               134013451350                                                                   GTGGCCCTCCTCATAGCCATGCTGTTCTTCATCTATGCGGTCATTGGC4614                           ValAlaLeuLeuIleAlaMetLeuPhePheIleTyrAlaValIleGly                               135513601365                                                                   ATGCAGATGTTTGGGAAAGTTGCCATGAGAGATAACAACCAGATCAAT4662                           MetGlnMetPheGlyLysValAlaMetArgAspAsnAsnGlnIleAsn                               137013751380                                                                   AGGAACAATAACTTCCAGACGTTTCCCCAGGCGGTGCTGCTGCTCTTC4710                           ArgAsnAsnAsnPheGlnThrPheProGlnAlaValLeuLeuLeuPhe                               1385139013951400                                                               AGGTGTGCAACAGGTGAGGCCTGGCAGGAGATCATGCTGGCCTGTCTC4758                           ArgCysAlaThrGlyGluAlaTrpGlnGluIleMetLeuAlaCysLeu                               140514101415                                                                   CCAGGGAAGCTCTGTGACCCTGAGTCAGATTACAACCCCGGGGAGGAG4806                           ProGlyLysLeuCysAspProGluSerAspTyrAsnProGlyGluGlu                               142014251430                                                                   CATACATGTGGGAGCAACTTTGCCATTGTCTATTTCATCAGTTTTTAC4854                           HisThrCysGlySerAsnPheAlaIleValTyrPheIleSerPheTyr                               143514401445                                                                   ATGCTCTGTGCATTTCTGATCATCAATCTGTTTGTGGCTGTCATCATG4902                           MetLeuCysAlaPheLeuIleIleAsnLeuPheValAlaValIleMet                               145014551460                                                                   GATAATTTCGACTATCTGACCCGGGACTGGTCTATTTTGGGGCCTCAC4950                           AspAsnPheAspTyrLeuThrArgAspTrpSerIleLeuGlyProHis                               1465147014751480                                                               CATTTAGATGAATTCAAAAGAATATGGTCAGAATATGACCCTGAGGCA4998                           HisLeuAspGluPheLysArgIleTrpSerGluTyrAspProGluAla                               148514901495                                                                   AAGGGAAGGATAAAACACCTTGATGTGGTCACTCTGCTTCGACGCATC5046                           LysGlyArgIleLysHisLeuAspValValThrLeuLeuArgArgIle                               150015051510                                                                   CAGCCTCCCCTGGGGTTTGGGAAGTTATGTCCACACAGGGTAGCGTGC5094                           GlnProProLeuGlyPheGlyLysLeuCysProHisArgValAlaCys                               151515201525                                                                   AAGAGATTAGTTGCCATGAACATGCCTCTCAACAGTGACGGGACAGTC5142                           LysArgLeuValAlaMetAsnMetProLeuAsnSerAspGlyThrVal                               153015351540                                                                   ATGTTTAATGCAACCCTGTTTGCTTTGGTTCGAACGGCTCTTAAGATC5190                           MetPheAsnAlaThrLeuPheAlaLeuValArgThrAlaLeuLysIle                               1545155015551560                                                               AAGACCGAAGGGAACCTGGAGCAAGCTAATGAAGAACTTCGGGCTGTG5238                           LysThrGluGlyAsnLeuGluGlnAlaAsnGluGluLeuArgAlaVal                               156515701575                                                                   ATAAAGAAAATTTGGAAGAAAACCAGCATGAAATTACTTGACCAAGTT5286                           IleLysLysIleTrpLysLysThrSerMetLysLeuLeuAspGlnVal                               158015851590                                                                   GTCCCTCCAGCTGGTGATGATGAGGTAACCGTGGGGAAGTTCTATGCC5334                           ValProProAlaGlyAspAspGluValThrValGlyLysPheTyrAla                               159516001605                                                                   ACTTTCCTGATACAGGACTACTTTAGGAAATTCAAGAAACGGAAAGAA5382                           ThrPheLeuIleGlnAspTyrPheArgLysPheLysLysArgLysGlu                               161016151620                                                                   CAAGGACTGGTGGGAAAGTACCCTGCGAAGAACACCACAATTGCCCTA5430                           GlnGlyLeuValGlyLysTyrProAlaLysAsnThrThrIleAlaLeu                               1625163016351640                                                               CAGGCGGGATTAAGGACACTGCATGACATTGGGCCAGAAATCCGGCGT5478                           GlnAlaGlyLeuArgThrLeuHisAspIleGlyProGluIleArgArg                               164516501655                                                                   GCTATATCGTGTGATTTGCAAGATGACGAGCCTGAGGAAACAAAACGA5526                           AlaIleSerCysAspLeuGlnAspAspGluProGluGluThrLysArg                               166016651670                                                                   GAAGAAGAAGATGATGTGTTCAAAAGAAATGGTGCCCTGCTTGGAAAC5574                           GluGluGluAspAspValPheLysArgAsnGlyAlaLeuLeuGlyAsn                               167516801685                                                                   CATGTCAATCATGTTAATAGTGATAGGAGAGATTCCCTTCAGCAGACC5622                           HisValAsnHisValAsnSerAspArgArgAspSerLeuGlnGlnThr                               169016951700                                                                   AATACCACCCACCGTCCCCTGCATGTCCAAAGGCCTTCAATTCCACCT5670                           AsnThrThrHisArgProLeuHisValGlnArgProSerIleProPro                               1705171017151720                                                               GCAAGTGATACTGAGAAACCGCTGTTTCCTCCAGCAGGAAATTCGGTG5718                           AlaSerAspThrGluLysProLeuPheProProAlaGlyAsnSerVal                               172517301735                                                                   TGTCATAACCATCATAACCATAATTCCATAGGAAAGCAAGTTCCCACC5766                           CysHisAsnHisHisAsnHisAsnSerIleGlyLysGlnValProThr                               174017451750                                                                   TCAACAAATGCCAATCTCAATAATGCCAATATGTCCAAAGCTGCCCAT5814                           SerThrAsnAlaAsnLeuAsnAsnAlaAsnMetSerLysAlaAlaHis                               175517601765                                                                   GGAAAGCGGCCCAGCATTGGGAACCTTGAGCATGTGTCTGAAAATGGG5862                           GlyLysArgProSerIleGlyAsnLeuGluHisValSerGluAsnGly                               177017751780                                                                   CATCATTCTTCCCACAAGCATGACCGGGAGCCTCAGAGAAGGTCCAGT5910                           HisHisSerSerHisLysHisAspArgGluProGlnArgArgSerSer                               1785179017951800                                                               GTGAAAAGAACCCGCTATTATGAAACTTACATTAGGTCCGACTCAGGA5958                           ValLysArgThrArgTyrTyrGluThrTyrIleArgSerAspSerGly                               180518101815                                                                   GATGAACAGCTCCCAACTATTTGCCGGGAAGACCCAGAGATACATGGC6006                           AspGluGlnLeuProThrIleCysArgGluAspProGluIleHisGly                               182018251830                                                                   TATTTCAGGGACCCCCACTGCTTGGGGGAGCAGGAGTATTTCAGTAGT6054                           TyrPheArgAspProHisCysLeuGlyGluGlnGluTyrPheSerSer                               183518401845                                                                   GAGGAATGCTACGAGGATGACAGCTCGCCCACCTGGAGCAGGCAAAAC6102                           GluGluCysTyrGluAspAspSerSerProThrTrpSerArgGlnAsn                               185018551860                                                                   TATGGCTACTACAGCAGATACCCAGGCAGAAACATCGACTCTGAGAGG6150                           TyrGlyTyrTyrSerArgTyrProGlyArgAsnIleAspSerGluArg                               1865187018751880                                                               CCCCGAGGCTACCATCATCCCCAAGGATTCTTGGAGGACGATGACTCG6198                           ProArgGlyTyrHisHisProGlnGlyPheLeuGluAspAspAspSer                               188518901895                                                                   CCCGTTTGCTATGATTCACGGAGATCTCCAAGGAGACGCCTACTACCT6246                           ProValCysTyrAspSerArgArgSerProArgArgArgLeuLeuPro                               190019051910                                                                   CCCACCCCAGCATCCCACCGGAGATCCTCCTTCAACTTTGAGTGCCTG6294                           ProThrProAlaSerHisArgArgSerSerPheAsnPheGluCysLeu                               191519201925                                                                   CGCCGGCAGAGCAGCCAGGAAGAGGTCCCGTCGTCTCCCATCTTCCCC6342                           ArgArgGlnSerSerGlnGluGluValProSerSerProIlePhePro                               193019351940                                                                   CATCGCACGGCCCTGCCTCTGCATCTAATGCAGCAACAGATCATGGCA6390                           HisArgThrAlaLeuProLeuHisLeuMetGlnGlnGlnIleMetAla                               1945195019551960                                                               GTTGCCGGCCTAGATTCAAGTAAAGCCCAGAAGTACTCACCGAGTCAC6438                           ValAlaGlyLeuAspSerSerLysAlaGlnLysTyrSerProSerHis                               196519701975                                                                   TCGACCCGGTCGTGGGCCACCCCTCCAGCAACCCCTCCCTACCGGGAC6486                           SerThrArgSerTrpAlaThrProProAlaThrProProTyrArgAsp                               198019851990                                                                   TGGACACCGTGCTACACCCCCCTGATCCAAGTGGAGCAGTCAGAGGCC6534                           TrpThrProCysTyrThrProLeuIleGlnValGluGlnSerGluAla                               199520002005                                                                   CTGGACCAGGTGAACGGCAGCCTGCCGTCCCTGCACCGCAGCTCCTGG6582                           LeuAspGlnValAsnGlySerLeuProSerLeuHisArgSerSerTrp                               201020152020                                                                   TACACAGACGAGCCCGACATCTCCTACCGGACTTTCACACCAGCCAGC6630                           TyrThrAspGluProAspIleSerTyrArgThrPheThrProAlaSer                               2025203020352040                                                               CTGACTGTCCCCAGCAGCTTCCGGAACAAAAACAGCGACAAGCAGAGG6678                           LeuThrValProSerSerPheArgAsnLysAsnSerAspLysGlnArg                               204520502055                                                                   AGTGCGGACAGCTTGGTGGAGGCAGTCCTGATATCCGAAGGCTTGGGA6726                           SerAlaAspSerLeuValGluAlaValLeuIleSerGluGlyLeuGly                               206020652070                                                                   CGCTATGCAAGGGACCCAAAATTTGTGTCAGCAACAAAACACGAAATC6774                           ArgTyrAlaArgAspProLysPheValSerAlaThrLysHisGluIle                               207520802085                                                                   GCTGATGCCTGTGACCTCACCATCGACGAGATGGAGAGTGCAGCCAGC6822                           AlaAspAlaCysAspLeuThrIleAspGluMetGluSerAlaAlaSer                               209020952100                                                                   ACCCTGCTTAATGGGAACGTGCGTCCCCGAGCCAACGGGGATGTGGGC6870                           ThrLeuLeuAsnGlyAsnValArgProArgAlaAsnGlyAspValGly                               2105211021152120                                                               CCCCTCTCACACCGGCAGGACTATGAGCTACAGGACTTTGGTCCTGGC6918                           ProLeuSerHisArgGlnAspTyrGluLeuGlnAspPheGlyProGly                               212521302135                                                                   TACAGCGACGAAGAGCCAGACCCTGGGAGGGATGAGGAGGACCTGGCG6966                           TyrSerAspGluGluProAspProGlyArgAspGluGluAspLeuAla                               214021452150                                                                   GATGAAATGATATGCATCACCACCTTGTAGCCCCCAGCGAGGGGCAG7013                            AspGluMetIleCysIleThrThrLeu                                                    21552160                                                                       ACTGGCTCTGGCCTCAGGTGGGGCGCAGGAGAGCCAGGGGAAAAGTGCCTCATAGTTAGG7073               AAAGTTTAGGCACTAGTTGGGAGTAATATTCAATTAATTAGACTTTTGTATAAGAGATGT7133               CATGCCTCAAGAAAGCCATAAACCTGGTAGGAACAGGTCCCAAGCGGTTGAGCCTGGCAG7193               AGTACCATGCGCTCGGCCCCAGCTGCAGGAAACAGCAGGCCCCGCCCTCTCACAGAGGAT7253               GGGTGAGGAGGCCAGACCTGCCCTGCCCCATTGTCCAGATGGGCACTGCTGTGGAGTCTG7313               CTTCTCCCATGTACCAGGGCACCAGGCCCACCCAACTGAAGGCATGGCGGCGGGGTGCAG7373               GGGAAAGTTAAAGGTGATGACGATCATCACACCTGTGTCGTTACCTCAGCCATCGGTCTA7433               GCATATCAGTCACTGGGCCCAACATATCCATTTTTAAACCCTTTCCCCCAAATACACTGC7493               GTCCTGGTTCCTGTTTAGCTGTTCTGAAATACGGTGTGTAAGTAAGTCAGAACCCAGCTA7553               CCAGTGATTATTGCGAGGGCAATGGGACCTCATAAATAAGGTTTTCTGTGATGTGACGCC7613               AGTTTACATAAGAGAATATCAC7635                                                     (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 104 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..102                                                           (ix) FEATURE:                                                                  (A) NAME/KEY: misc_feature                                                     (B) LOCATION: 1..104                                                           (D) OTHER INFORMATION: /note="A 104-nucleotide                                 alternative exon of alpha-1D."                                                 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        GTAAATGATGCGATAGGATGGGAATGGCCATGGGTGTATTTTGTTAGT48                             ValAsnAspAlaIleGlyTrpGluTrpProTrpValTyrPheValSer                               151015                                                                         CTGATCATCCTTGGCTCATTTTTCGTCCTTAACCTGGTTCTTGGTGTC96                             LeuIleIleLeuGlySerPhePheValLeuAsnLeuValLeuGlyVal                               202530                                                                         CTTAGTGG104                                                                    LeuSer                                                                         (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 5904 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..5904                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        ATGGTCAATGAGAATACGAGGATGTACATTCCAGAGGAAAACCACCAA48                             MetValAsnGluAsnThrArgMetTyrIleProGluGluAsnHisGln                               151015                                                                         GGTTCCAACTATGGGAGCCCACGCCCCGCCCATGCCAACATGAATGCC96                             GlySerAsnTyrGlySerProArgProAlaHisAlaAsnMetAsnAla                               202530                                                                         AATGCGGCAGCGGGGCTGGCCCCTGAGCACATCCCCACCCCGGGGGCT144                            AsnAlaAlaAlaGlyLeuAlaProGluHisIleProThrProGlyAla                               354045                                                                         GCCCTGTCGTGGCAGGCGGCCATCGACGCAGCCCGGCAGGCTAAGCTG192                            AlaLeuSerTrpGlnAlaAlaIleAspAlaAlaArgGlnAlaLysLeu                               505560                                                                         ATGGGCAGCGCTGGCAATGCGACCATCTCCACAGTCAGCTCCACGCAG240                            MetGlySerAlaGlyAsnAlaThrIleSerThrValSerSerThrGln                               65707580                                                                       CGGAAGCGCCAGCAATATGGGAAACCCAAGAAGCAGGGCAGCACCACG288                            ArgLysArgGlnGlnTyrGlyLysProLysLysGlnGlySerThrThr                               859095                                                                         GCCACACGCCCGCCCCGAGCCCTGCTCTGCCTGACCCTGAAGAACCCC336                            AlaThrArgProProArgAlaLeuLeuCysLeuThrLeuLysAsnPro                               100105110                                                                      ATCCGGAGGGCCTGCATCAGCATTGTCGAATGGAAACCATTTGAAATA384                            IleArgArgAlaCysIleSerIleValGluTrpLysProPheGluIle                               115120125                                                                      ATTATTTTACTGACTATTTTTGCCAATTGTGTGGCCTTAGCGATCTAT432                            IleIleLeuLeuThrIlePheAlaAsnCysValAlaLeuAlaIleTyr                               130135140                                                                      ATTCCCTTTCCAGAAGATGATTCCAACGCCACCAATTCCAACCTGGAA480                            IleProPheProGluAspAspSerAsnAlaThrAsnSerAsnLeuGlu                               145150155160                                                                   CGAGTGGAATATCTCTTTCTCATAATTTTTACGGTGGAAGCGTTTTTA528                            ArgValGluTyrLeuPheLeuIleIlePheThrValGluAlaPheLeu                               165170175                                                                      AAAGTAATCGCCTATGGACTCCTCTTTCACCCCAATGCCTACCTCCGC576                            LysValIleAlaTyrGlyLeuLeuPheHisProAsnAlaTyrLeuArg                               180185190                                                                      AACGGCTGGAACCTACTAGATTTTATAATTGTGGTTGTGGGGCTTTTT624                            AsnGlyTrpAsnLeuLeuAspPheIleIleValValValGlyLeuPhe                               195200205                                                                      AGTGCAATTTTAGAACAAGCAACCAAAGCAGATGGGGCAAACGCTCTC672                            SerAlaIleLeuGluGlnAlaThrLysAlaAspGlyAlaAsnAlaLeu                               210215220                                                                      GGAGGGAAAGGGGCCGGATTTGATGTGAAGGCGCTGAGGGCCTTCCGC720                            GlyGlyLysGlyAlaGlyPheAspValLysAlaLeuArgAlaPheArg                               225230235240                                                                   GTGCTGCGCCCCCTGCGGCTGGTGTCCGGAGTCCCAAGTCTCCAGGTG768                            ValLeuArgProLeuArgLeuValSerGlyValProSerLeuGlnVal                               245250255                                                                      GTCCTGAATTCCATCATCAAGGCCATGGTCCCCCTGCTGCACATCGCC816                            ValLeuAsnSerIleIleLysAlaMetValProLeuLeuHisIleAla                               260265270                                                                      CTGCTTGTGCTGTTTGTCATCATCATCTACGCCATCATCGGCTTGGAG864                            LeuLeuValLeuPheValIleIleIleTyrAlaIleIleGlyLeuGlu                               275280285                                                                      CTCTTCATGGGGAAGATGCACAAGACCTGCTACAACCAGGAGGGCATA912                            LeuPheMetGlyLysMetHisLysThrCysTyrAsnGlnGluGlyIle                               290295300                                                                      GCAGATGTTCCAGCAGAAGATGACCCTTCCCCTTGTGCGCTGGAAACG960                            AlaAspValProAlaGluAspAspProSerProCysAlaLeuGluThr                               305310315320                                                                   GGCCACGGGCGGCAGTGCCAGAACGGCACGGTGTGCAAGCCCGGCTGG1008                           GlyHisGlyArgGlnCysGlnAsnGlyThrValCysLysProGlyTrp                               325330335                                                                      GATGGTCCCAAGCACGGCATCACCAACTTTGACAACTTTGCCTTCGCC1056                           AspGlyProLysHisGlyIleThrAsnPheAspAsnPheAlaPheAla                               340345350                                                                      ATGCTCACGGTGTTCCAGTGCATCACCATGGAGGGCTGGACGGACGTG1104                           MetLeuThrValPheGlnCysIleThrMetGluGlyTrpThrAspVal                               355360365                                                                      CTGTACTGGGTCAATGATGCCGTAGGAAGGGACTGGCCCTGGATCTAT1152                           LeuTyrTrpValAsnAspAlaValGlyArgAspTrpProTrpIleTyr                               370375380                                                                      TTTGTTACACTAATCATCATAGGGTCATTTTTTGTACTTAACTTGGTT1200                           PheValThrLeuIleIleIleGlySerPhePheValLeuAsnLeuVal                               385390395400                                                                   CTCGGTGTGCTTAGCGGAGAGTTTTCCAAAGAGAGGGAGAAGGCCAAG1248                           LeuGlyValLeuSerGlyGluPheSerLysGluArgGluLysAlaLys                               405410415                                                                      GCCCGGGGAGATTTCCAGAAGCTGCGGGAGAAGCAGCAGCTAGAAGAG1296                           AlaArgGlyAspPheGlnLysLeuArgGluLysGlnGlnLeuGluGlu                               420425430                                                                      GATCTCAAAGGCTACCTGGATTGGATCACTCAGGCCGAAGACATCGNT1344                           AspLeuLysGlyTyrLeuAspTrpIleThrGlnAlaGluAspIleXaa                               435440445                                                                      CCTGAGAATGAGGACGAAGGCATGGATGAGGAGAAGCCCCGAAACAGA1392                           ProGluAsnGluAspGluGlyMetAspGluGluLysProArgAsnArg                               450455460                                                                      GGCACTCCGGCGGGCATGCTTGATCAGAAGAAAGGGAAGTTTGCTTGG1440                           GlyThrProAlaGlyMetLeuAspGlnLysLysGlyLysPheAlaTrp                               465470475480                                                                   TTTAGTCACTCCACAGAAACCCATGTGAGCATGCCCACCAGTGAGACC1488                           PheSerHisSerThrGluThrHisValSerMetProThrSerGluThr                               485490495                                                                      GAGTCCGTCAACACCGAAAACGTGGCTGGAGGTGACATCGAGGGAGAA1536                           GluSerValAsnThrGluAsnValAlaGlyGlyAspIleGluGlyGlu                               500505510                                                                      AACTGCGGGGCCAGGCTGGCCCACCGGATCTCCAAGTCAAAGTTCAGC1584                           AsnCysGlyAlaArgLeuAlaHisArgIleSerLysSerLysPheSer                               515520525                                                                      CGCTACTGGCGCCGGTGGAATCGGTTCTGCAGAAGGAAGTGCCGCGCC1632                           ArgTyrTrpArgArgTrpAsnArgPheCysArgArgLysCysArgAla                               530535540                                                                      GCAGTCAAGTCTAATGTCTTCTACTGGCTGGTGATTTTCCTGGTGTTC1680                           AlaValLysSerAsnValPheTyrTrpLeuValIlePheLeuValPhe                               545550555560                                                                   CTCAACACGCTCACCATTGCCTCTGAGCACTACAACCAGCCCAACTGG1728                           LeuAsnThrLeuThrIleAlaSerGluHisTyrAsnGlnProAsnTrp                               565570575                                                                      CTCACAGAAGTCCAAGACACGGCAAACAAGGCCCTGCTGGCCCTGTTC1776                           LeuThrGluValGlnAspThrAlaAsnLysAlaLeuLeuAlaLeuPhe                               580585590                                                                      ACGGCAGAGATGCTCCTGAAGATGTACAGCCTGGGCCTGCAGGCCTAC1824                           ThrAlaGluMetLeuLeuLysMetTyrSerLeuGlyLeuGlnAlaTyr                               595600605                                                                      TTCGTGTCCCTCTTCAACCGCTTTGACTGCTTCGTCGTGTGTGGCGGC1872                           PheValSerLeuPheAsnArgPheAspCysPheValValCysGlyGly                               610615620                                                                      ATCCTGGAGACCATCCTGGTGGAGACCAAGATCATGTCCCCACTGGGC1920                           IleLeuGluThrIleLeuValGluThrLysIleMetSerProLeuGly                               625630635640                                                                   ATCTCCGTGCTCAGATGCGTCCGGCTGCTGAGGATTTTCAAGATCACG1968                           IleSerValLeuArgCysValArgLeuLeuArgIlePheLysIleThr                               645650655                                                                      AGGTACTGGAACTCCTTGAGCAACCTGGTGGCATCCTTGCTGAACTCT2016                           ArgTyrTrpAsnSerLeuSerAsnLeuValAlaSerLeuLeuAsnSer                               660665670                                                                      GTGCGCTCCATCGCCTCCCTGCTCCTTCTCCTCTTCCTCTTCATCATC2064                           ValArgSerIleAlaSerLeuLeuLeuLeuLeuPheLeuPheIleIle                               675680685                                                                      ATCTTCTCCCTCCTGGGGATGCAGCTCTTTGGAGGAAAGTTCAACTTT2112                           IlePheSerLeuLeuGlyMetGlnLeuPheGlyGlyLysPheAsnPhe                               690695700                                                                      GATGAGATGCAGACCCGGAGGAGCACATTCGATAACTTCCCCCAGTCC2160                           AspGluMetGlnThrArgArgSerThrPheAspAsnPheProGlnSer                               705710715720                                                                   CTCCTCACTGTGTTTCAGATCCTGACCGGGGAGGACTGGAATTCGGTG2208                           LeuLeuThrValPheGlnIleLeuThrGlyGluAspTrpAsnSerVal                               725730735                                                                      ATGTATGATGGGATCATGGCTTATGGGGGCCCCTCTTTTCCAGGGATG2256                           MetTyrAspGlyIleMetAlaTyrGlyGlyProSerPheProGlyMet                               740745750                                                                      TTAGTCTGTATTTACTTCATCATCCTCTTCATCTCTGGAAACTATATC2304                           LeuValCysIleTyrPheIleIleLeuPheIleSerGlyAsnTyrIle                               755760765                                                                      CTACTGAATGTGTTCTTGGCCATTGCTGTGGACAACCTGGCTGATGCT2352                           LeuLeuAsnValPheLeuAlaIleAlaValAspAsnLeuAlaAspAla                               770775780                                                                      GAGAGCCTCACATCTGCCCTAAAGGAGGAGGAAGAGGAGAAGGAGAGA2400                           GluSerLeuThrSerAlaLeuLysGluGluGluGluGluLysGluArg                               785790795800                                                                   AAGAAGCTGGCCAGGACTGCCAGCCCAGAGAAGAAACAAGAGTTGGTG2448                           LysLysLeuAlaArgThrAlaSerProGluLysLysGlnGluLeuVal                               805810815                                                                      GAGAAGCCGGCAGTGGGGGAATCCAAGGAGGAGAAGATTGAGCTGAAA2496                           GluLysProAlaValGlyGluSerLysGluGluLysIleGluLeuLys                               820825830                                                                      TCCATCACGGCTGACGGAGAGTCTCCACCCGCCACCAAGATCAACATG2544                           SerIleThrAlaAspGlyGluSerProProAlaThrLysIleAsnMet                               835840845                                                                      GATGACCTCCAGCCCAATGAAAATGAGGATAAGAGCCCCTACCCCAAC2592                           AspAspLeuGlnProAsnGluAsnGluAspLysSerProTyrProAsn                               850855860                                                                      CCAGAAACTACAGGAGAAGAGGATGAGGAGGAGCCAGAGATGCCTGTC2640                           ProGluThrThrGlyGluGluAspGluGluGluProGluMetProVal                               865870875880                                                                   GGCCCTCGCCCACGACCACTCTCTGAGCTTCACCTTAAGGAAAAGGCA2688                           GlyProArgProArgProLeuSerGluLeuHisLeuLysGluLysAla                               885890895                                                                      GTGCCCATGCCAGAAGCCAGCGCGTTTTTCATCTTCAGCTCTAACAAC2736                           ValProMetProGluAlaSerAlaPhePheIlePheSerSerAsnAsn                               900905910                                                                      AGGTTTCGCCTCCAGTGCCACCGCATTGTCAATGACACGATCTTCACC2784                           ArgPheArgLeuGlnCysHisArgIleValAsnAspThrIlePheThr                               915920925                                                                      AACCTGATCCTCTTCTTCATTCTGCTCAGCAGCATTTCCCTGGCTGCT2832                           AsnLeuIleLeuPhePheIleLeuLeuSerSerIleSerLeuAlaAla                               930935940                                                                      GAGGACCCGGTCCAGCACACCTCCTTCAGGAACCATATTCTGTTTTAT2880                           GluAspProValGlnHisThrSerPheArgAsnHisIleLeuPheTyr                               945950955960                                                                   TTTGATATTGTTTTTACCACCATTTTCACCATTGAAATTGCTCTGAAG2928                           PheAspIleValPheThrThrIlePheThrIleGluIleAlaLeuLys                               965970975                                                                      ATGACTGCTTATGGGGCTTTCTTGCACAAGGGTTCTTTCTGCCGGAAC2976                           MetThrAlaTyrGlyAlaPheLeuHisLysGlySerPheCysArgAsn                               980985990                                                                      TACTTCAACATCCTGGACCTGCTGGTGGTCAGCGTGTCCCTCATCTCC3024                           TyrPheAsnIleLeuAspLeuLeuValValSerValSerLeuIleSer                               99510001005                                                                    TTTGGCATCCAGTCCAGTGCAATCAATGTCGTGAAGATCTTGCGAGTC3072                           PheGlyIleGlnSerSerAlaIleAsnValValLysIleLeuArgVal                               101010151020                                                                   CTGCGAGTACTCAGGCCCCTGAGGGCCATCAACAGGGCCAAGGGGCTA3120                           LeuArgValLeuArgProLeuArgAlaIleAsnArgAlaLysGlyLeu                               1025103010351040                                                               AAGCATGTGGTTCAGTGTGTGTTTGTCGCCATCCGGACCATCGGGAAC3168                           LysHisValValGlnCysValPheValAlaIleArgThrIleGlyAsn                               104510501055                                                                   ATCGTGATTGTCACCACCCTGCTGCAGTTCATGTTTGCCTGCATCGGG3216                           IleValIleValThrThrLeuLeuGlnPheMetPheAlaCysIleGly                               106010651070                                                                   GTCCAGCTCTTCAAGGGAAAGCTGTACACCTGTTCAGACAGTTCCAAG3264                           ValGlnLeuPheLysGlyLysLeuTyrThrCysSerAspSerSerLys                               107510801085                                                                   CAGACAGAGGCGGAATGCAAGGGCAACTACATCACGTACAAAGACGGG3312                           GlnThrGluAlaGluCysLysGlyAsnTyrIleThrTyrLysAspGly                               109010951100                                                                   GAGGTTGACCACCCCATCATCCAACCCCGCAGCTGGGAGAACAGCAAG3360                           GluValAspHisProIleIleGlnProArgSerTrpGluAsnSerLys                               1105111011151120                                                               TTTGACTTTGACAATGTTCTGGCAGCCATGATGGCCCTCTTCACCGTC3408                           PheAspPheAspAsnValLeuAlaAlaMetMetAlaLeuPheThrVal                               112511301135                                                                   TCCACCTTCGAAGGGTGGCCAGAGCTGCTGTACCGCTCCATCGACTCC3456                           SerThrPheGluGlyTrpProGluLeuLeuTyrArgSerIleAspSer                               114011451150                                                                   CACACGGAAGACAAGGGCCCCATCTACAACTACCGTGTGGAGATCTCC3504                           HisThrGluAspLysGlyProIleTyrAsnTyrArgValGluIleSer                               115511601165                                                                   ATCTTCTTCATCATCTACATCATCATCATCGCCTTCTTCATGATGAAC3552                           IlePhePheIleIleTyrIleIleIleIleAlaPhePheMetMetAsn                               117011751180                                                                   ATCTTCGTGGGCTTCGTCATCGTCACCTTTCAGGAGCAGGGGGAGCAG3600                           IlePheValGlyPheValIleValThrPheGlnGluGlnGlyGluGln                               1185119011951200                                                               GAGTACAAGAACTGTGAGCTGGACAAGAACCAGCGACAGTGCGTGGAA3648                           GluTyrLysAsnCysGluLeuAspLysAsnGlnArgGlnCysValGlu                               120512101215                                                                   TACGCCCTCAAGGCCCGGCCCCTGCGGAGGTACATCCCCAAGAACCAG3696                           TyrAlaLeuLysAlaArgProLeuArgArgTyrIleProLysAsnGln                               122012251230                                                                   CACCAGTACAAAGTGTGGTACGTGGTCAACTCCACCTACTTCGAGTAC3744                           HisGlnTyrLysValTrpTyrValValAsnSerThrTyrPheGluTyr                               123512401245                                                                   CTGATGTTCGTCCTCATCCTGCTCAACACCATCTGCCTGGCCATGCAG3792                           LeuMetPheValLeuIleLeuLeuAsnThrIleCysLeuAlaMetGln                               125012551260                                                                   CACTACGGCCAGAGCTGCCTGTTCAAAATCGCCATGAACATCCTCAAC3840                           HisTyrGlyGlnSerCysLeuPheLysIleAlaMetAsnIleLeuAsn                               1265127012751280                                                               ATGCTCTTCACTGGCCTCTTCACCGTGGAGATGATCCTGAAGCTCATT3888                           MetLeuPheThrGlyLeuPheThrValGluMetIleLeuLysLeuIle                               128512901295                                                                   GCCTTCAAACCCAAGGGTTACTTTAGTGATCCCTGGAATGTTTTTGAC3936                           AlaPheLysProLysGlyTyrPheSerAspProTrpAsnValPheAsp                               130013051310                                                                   TTCCTCATCGTAATTGGCAGCATAATTGACGTCATTCTCAGTGAGACT3984                           PheLeuIleValIleGlySerIleIleAspValIleLeuSerGluThr                               131513201325                                                                   AATCCAGCTGAACATACCCAATGCTCTCCCTCTATGAACGCAGAGGAA4032                           AsnProAlaGluHisThrGlnCysSerProSerMetAsnAlaGluGlu                               133013351340                                                                   AACTCCCGCATCTCCATCACCTTCTTCCGCCTGTTCCGGGTCATGCGT4080                           AsnSerArgIleSerIleThrPhePheArgLeuPheArgValMetArg                               1345135013551360                                                               CTGGTGAAGCTGCTGAGCCGTGGGGAGGGCATCCGGACGCTGCTGTGG4128                           LeuValLysLeuLeuSerArgGlyGluGlyIleArgThrLeuLeuTrp                               136513701375                                                                   ACCTTCATCAAGTCCTTCCAGGCCCTGCCCTATGTGGCCCTCCTGATC4176                           ThrPheIleLysSerPheGlnAlaLeuProTyrValAlaLeuLeuIle                               138013851390                                                                   GTGATGCTGTTCTTCATCTACGCGGTGATCGGGATGCAGGTGTTTGGG4224                           ValMetLeuPhePheIleTyrAlaValIleGlyMetGlnValPheGly                               139514001405                                                                   AAAATTGCCCTGAATGATACCACAGAGATCAACCGGAACAACAACTTT4272                           LysIleAlaLeuAsnAspThrThrGluIleAsnArgAsnAsnAsnPhe                               141014151420                                                                   CAGACCTTCCCCCAGGCCGTGCTGCTCCTCTTCAGGTGTGCCACCGGG4320                           GlnThrPheProGlnAlaValLeuLeuLeuPheArgCysAlaThrGly                               1425143014351440                                                               GAGGCCTGGCAGGACATCATGCTGGCCTGCATGCCAGGCAAGAAGTGT4368                           GluAlaTrpGlnAspIleMetLeuAlaCysMetProGlyLysLysCys                               144514501455                                                                   GCCCCAGAGTCCGAGCCCAGCAACAGCACGGAGGGTGAAACACCCTGT4416                           AlaProGluSerGluProSerAsnSerThrGluGlyGluThrProCys                               146014651470                                                                   GGTAGCAGCTTTGCTGTCTTCTACTTCATCAGCTTCTACATGCGCTGT4464                           GlySerSerPheAlaValPheTyrPheIleSerPheTyrMetArgCys                               147514801485                                                                   GCCTTCCTGATCATCAACCTCTTTGTAGCTGTCATCATGGACAACTTT4512                           AlaPheLeuIleIleAsnLeuPheValAlaValIleMetAspAsnPhe                               149014951500                                                                   GACTACCTGACAAGGGACTGGTCCATCCTTGGTCCCCACCACCTGGAT4560                           AspTyrLeuThrArgAspTrpSerIleLeuGlyProHisHisLeuAsp                               1505151015151520                                                               GAGTTTAAAAGAATCTGGGCAGAGTATGACCCTGAAGCCAAGGGTCGT4608                           GluPheLysArgIleTrpAlaGluTyrAspProGluAlaLysGlyArg                               152515301535                                                                   ATCAAACACCTGGATGTGGTGACCCTCCTCCGGCGGATTCAGCCGCCA4656                           IleLysHisLeuAspValValThrLeuLeuArgArgIleGlnProPro                               154015451550                                                                   CTAGGTTTTGGGAAGCTGTGCCCTCACCGCGTGGCTTGCAAACGCCTG4704                           LeuGlyPheGlyLysLeuCysProHisArgValAlaCysLysArgLeu                               155515601565                                                                   GTCTCCATGAACATGCCTCTGAACAGCGACGGGACAGTCATGTTCAAT4752                           ValSerMetAsnMetProLeuAsnSerAspGlyThrValMetPheAsn                               157015751580                                                                   GCCACCCTGTTTGCCCTGGTCAGGACGGCCCTGAGGATCAAAACAGAA4800                           AlaThrLeuPheAlaLeuValArgThrAlaLeuArgIleLysThrGlu                               1585159015951600                                                               GGGAACCTAGAACAAGCCAATGAGGAGCTGCGGGCGATCATCAAGAAG4848                           GlyAsnLeuGluGlnAlaAsnGluGluLeuArgAlaIleIleLysLys                               160516101615                                                                   ATCTGGAAGCGGACCAGCATGAAGCTGCTGGACCAGGTGGTGCCCCCT4896                           IleTrpLysArgThrSerMetLysLeuLeuAspGlnValValProPro                               162016251630                                                                   GCAGGTGATGATGAGGTCACCGTTGGCAAGTTCTACGCCACGTTCCTG4944                           AlaGlyAspAspGluValThrValGlyLysPheTyrAlaThrPheLeu                               163516401645                                                                   ATCCAGGAGTACTTCCGGAAGTTCAAGAAGCGCAAAGAGCAGGGCCTT4992                           IleGlnGluTyrPheArgLysPheLysLysArgLysGluGlnGlyLeu                               165016551660                                                                   GTGGGCAAGCCCTCCCAGAGGAACGCGCTGTCTCTGCAGGCTGGCTTG5040                           ValGlyLysProSerGlnArgAsnAlaLeuSerLeuGlnAlaGlyLeu                               1665167016751680                                                               CGCACACTGCATGACATCGGGCCTGAGATCCGACGGGCCATCTCTGGA5088                           ArgThrLeuHisAspIleGlyProGluIleArgArgAlaIleSerGly                               168516901695                                                                   GATCTCACCGCTGAGGAGGAGCTGGACAAGGCCATGAAGGAGGCTGTG5136                           AspLeuThrAlaGluGluGluLeuAspLysAlaMetLysGluAlaVal                               170017051710                                                                   TCCGCTGCTTCTGAAGATGACATCTTCAGGAGGGCCGGTGGCCTGTTC5184                           SerAlaAlaSerGluAspAspIlePheArgArgAlaGlyGlyLeuPhe                               171517201725                                                                   GGCAACCACGTCAGCTACTACCAAAGCGACGGCCGGAGCGCCTTCCCC5232                           GlyAsnHisValSerTyrTyrGlnSerAspGlyArgSerAlaPhePro                               173017351740                                                                   CAGACCTTCACCACTCAGCGCCCGCTGCACATCAACAAGGCGGGCAGC5280                           GlnThrPheThrThrGlnArgProLeuHisIleAsnLysAlaGlySer                               1745175017551760                                                               AGCCAGGGCGACACTGAGTCGCCATCCCACGAGAAGCTGGTGGACTCC5328                           SerGlnGlyAspThrGluSerProSerHisGluLysLeuValAspSer                               176517701775                                                                   ACCTTCACCCCGAGCAGCTACTCGTCCACCGGCTCCAACGCCAACATC5376                           ThrPheThrProSerSerTyrSerSerThrGlySerAsnAlaAsnIle                               178017851790                                                                   AACAACGCCAACAACACCGCCCTGGGTCGCCTCCCTCGCCCCGCCGGC5424                           AsnAsnAlaAsnAsnThrAlaLeuGlyArgLeuProArgProAlaGly                               179518001805                                                                   TACCCCAGCACAGTCAGCACTGTGGAGGGCCACGGGCCCCCCTTGTCC5472                           TyrProSerThrValSerThrValGluGlyHisGlyProProLeuSer                               181018151820                                                                   CCTGCCATCCGGGTGCAGGAGGTGGCGTGGAAGCTCAGCTCCAACAGG5520                           ProAlaIleArgValGlnGluValAlaTrpLysLeuSerSerAsnArg                               1825183018351840                                                               TGCCACTCCCGGGAGAGCCAGGCAGCCATGGCGCGTCAGGAGGAGACG5568                           CysHisSerArgGluSerGlnAlaAlaMetAlaArgGlnGluGluThr                               184518501855                                                                   TCTCAGGATGAGACCTATGAAGTGAAGATGAACCATGACACGGAGGCC5616                           SerGlnAspGluThrTyrGluValLysMetAsnHisAspThrGluAla                               186018651870                                                                   TGCAGTGAGCCCAGCCTGCTCTCCACAGAGATGCTCTCCTACCAGGAT5664                           CysSerGluProSerLeuLeuSerThrGluMetLeuSerTyrGlnAsp                               187518801885                                                                   GACGAAAATCGGCAACTGACGCTCCCAGAGGAGGACAAGAGGGACATC5712                           AspGluAsnArgGlnLeuThrLeuProGluGluAspLysArgAspIle                               189018951900                                                                   CGGCAATCTCCGAAGAGGGGTTTCCTCCGCTCTTCCTCACTAGGTCGA5760                           ArgGlnSerProLysArgGlyPheLeuArgSerSerSerLeuGlyArg                               1905191019151920                                                               AGGGCCTCCTTCCACCTGGAATGTCTGAAGCGACAGAAGGACCGAGGG5808                           ArgAlaSerPheHisLeuGluCysLeuLysArgGlnLysAspArgGly                               192519301935                                                                   GGAGACATCTCTCAGAAGACAGTCCTGCCCTTGCATCTGGTTCATCAT5856                           GlyAspIleSerGlnLysThrValLeuProLeuHisLeuValHisHis                               194019451950                                                                   CAGGCATTGGCAGTGGCAGGCCTGAGCCCCCTCCTCCAGAGAAGCCAT5904                           GlnAlaLeuAlaValAlaGlyLeuSerProLeuLeuGlnArgSerHis                               195519601965                                                                   (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 132 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        AGACCACGGCTTCCTCGAATCTTGCGCGAAGCCGCCGGCCTCGGAGGAGGGATTAATCCA60                 GACCCGCCGGGGGGTGTTTTCACATTTCTTCCTCTTCGTGGCTGCTCCTCCTATTAAAAC120                CATTTTTGGTCC132                                                                (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 89 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        CGCTGAGGGCCTTCCGCGTGCTGCGCCCCCTGCGGCTGGTGTCCGGAGTCCCAAGTCTCC60                 AGGTGGTCCTGAATTCCATCATCAAGGCC89                                                (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 84 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..84                                                            (D) OTHER INFORMATION: /note="An alternative exon of                           alpha-1C."                                                                     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        CACTATTTCTGTGATGCATGGAATACATTTGACGCCTTGATTGTTGTG48                             HisTyrPheCysAspAlaTrpAsnThrPheAspAlaLeuIleValVal                               151015                                                                         GGTAGCATTGTTGATATAGCAATCACCGAGGTAAAC84                                         GlySerIleValAspIleAlaIleThrGluValAsn                                           2025                                                                           (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7362 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 144..7163                                                        (ix) FEATURE:                                                                  (A) NAME/KEY: 5'UTR                                                            (B) LOCATION: 1..143                                                           (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 7161..7362                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        GCGGCGGCGGCTGCGGCGGTGGGGCCGGGCGAGGTCCGTGCGGTCCCGGCGGCTCCGTGG60                 CTGCTCCGCTCTGAGCGCCTGCGCGCCCCGCGCCCTCCCTGCCGGGGCCGCTGGGCCGGG120                GATGCACGCGGGGCCCGGGAGCCATGGTCCGCTTCGGGGACGAGCTGGGC170                          MetValArgPheGlyAspGluLeuGly                                                    15                                                                             GGCCGCTATGGAGGCCCCGGCGGCGGAGAGCGGGCCCGGGGCGGCGGG218                            GlyArgTyrGlyGlyProGlyGlyGlyGluArgAlaArgGlyGlyGly                               10152025                                                                       GCCGGCGGGGCGGGGGGCCCGGGTCCCGGGGGGCTGCAGCCCGGCCAG266                            AlaGlyGlyAlaGlyGlyProGlyProGlyGlyLeuGlnProGlyGln                               303540                                                                         CGGGTCCTCTACAAGCAATCGATCGCGCAGCGCGCGCGGACCATGGCG314                            ArgValLeuTyrLysGlnSerIleAlaGlnArgAlaArgThrMetAla                               455055                                                                         CTGTACAACCCCATCCCGGTCAAGCAGAACTGCTTCACCGTCAACCGC362                            LeuTyrAsnProIleProValLysGlnAsnCysPheThrValAsnArg                               606570                                                                         TCGCTCTTCGTCTTCAGCGAGGACAACGTCGTCCGCAAATACGCGAAG410                            SerLeuPheValPheSerGluAspAsnValValArgLysTyrAlaLys                               758085                                                                         CGCATCACCGAGTGGCCTCCATTCGAGAATATGATCCTGGCCACCATC458                            ArgIleThrGluTrpProProPheGluAsnMetIleLeuAlaThrIle                               9095100105                                                                     ATCGCCAACTGCATCGTGCTGGCCCTGGAGCAGCACCTCCCTGATGGG506                            IleAlaAsnCysIleValLeuAlaLeuGluGlnHisLeuProAspGly                               110115120                                                                      GACAAAACGCCCATGTCCGAGCGGCTGGACGACACGGAGCCCTATTTC554                            AspLysThrProMetSerGluArgLeuAspAspThrGluProTyrPhe                               125130135                                                                      ATCGGGATCTTTTGCTTCGAGGCAGGGATCAAAATCATCGCTCTGGGC602                            IleGlyIlePheCysPheGluAlaGlyIleLysIleIleAlaLeuGly                               140145150                                                                      TTTGTCTTCCACAAGGGCTCTTACCTGCGGAACGGCTGGAACGTCATG650                            PheValPheHisLysGlySerTyrLeuArgAsnGlyTrpAsnValMet                               155160165                                                                      GACTTCGTGGTCGTCCTCACAGGGATCCTTGCCACGGCTGGAACTGAC698                            AspPheValValValLeuThrGlyIleLeuAlaThrAlaGlyThrAsp                               170175180185                                                                   TTCGACCTGCGAACACTGAGGGCTGTGCGTGTGCTGAGGCCCCTGAAG746                            PheAspLeuArgThrLeuArgAlaValArgValLeuArgProLeuLys                               190195200                                                                      CTGGTGTCTGGGATTCCAAGTTTGCAGGTGGTGCTCAAGTCCATCATG794                            LeuValSerGlyIleProSerLeuGlnValValLeuLysSerIleMet                               205210215                                                                      AAGGCCATGGTTCCACTCCTGCAGATTGGGCTGCTTCTCTTCTTTGCC842                            LysAlaMetValProLeuLeuGlnIleGlyLeuLeuLeuPhePheAla                               220225230                                                                      ATCCTCATGTTTGCCATCATTGGCCTGGAGTTCTACATGGGCAAGTTC890                            IleLeuMetPheAlaIleIleGlyLeuGluPheTyrMetGlyLysPhe                               235240245                                                                      CACAAGGCCTGTTTCCCCAACAGCACAGATGCGGAGCCCGTGGGTGAC938                            HisLysAlaCysPheProAsnSerThrAspAlaGluProValGlyAsp                               250255260265                                                                   TTCCCCTGTGGCAAGGAGGCCCCAGCCCGGCTGTGCGAGGGCGACACT986                            PheProCysGlyLysGluAlaProAlaArgLeuCysGluGlyAspThr                               270275280                                                                      GAGTGCCGGGAGTACTGGCCAGGACCCAACTTTGGCATCACCAACTTT1034                           GluCysArgGluTyrTrpProGlyProAsnPheGlyIleThrAsnPhe                               285290295                                                                      GACAATATCCTGTTTGCCATCTTGACGGTGTTCCAGTGCATCACCATG1082                           AspAsnIleLeuPheAlaIleLeuThrValPheGlnCysIleThrMet                               300305310                                                                      GAGGGCTGGACTGACATCCTCTATAATACAAACGATGCGGCCGGCAAC1130                           GluGlyTrpThrAspIleLeuTyrAsnThrAsnAspAlaAlaGlyAsn                               315320325                                                                      ACCTGGAACTGGCTCTACTTCATCCCTCTCATCATCATCGGCTCCTTC1178                           ThrTrpAsnTrpLeuTyrPheIleProLeuIleIleIleGlySerPhe                               330335340345                                                                   TTCATGCTCAACCTGGTGCTGGGCGTGCTCTCGGGGGAGTTTGCCAAG1226                           PheMetLeuAsnLeuValLeuGlyValLeuSerGlyGluPheAlaLys                               350355360                                                                      GAGCGAGAGAGGGTGGAGAACCGCCGCGCCTTCCTGAAGCTGCGCCGG1274                           GluArgGluArgValGluAsnArgArgAlaPheLeuLysLeuArgArg                               365370375                                                                      CAGCAGCAGATCGAGCGAGAGCTCAACGGGTACCTGGAGTGGATCTTC1322                           GlnGlnGlnIleGluArgGluLeuAsnGlyTyrLeuGluTrpIlePhe                               380385390                                                                      AAGGCGGAGGAAGTCATGCTGGCCGAGGAGGACAGGAATGCAGAGGAG1370                           LysAlaGluGluValMetLeuAlaGluGluAspArgAsnAlaGluGlu                               395400405                                                                      AAGTCCCCTTTGGACGTGCTGAAGAGAGCGGCCACCAAGAAGAGCAGA1418                           LysSerProLeuAspValLeuLysArgAlaAlaThrLysLysSerArg                               410415420425                                                                   AATGACCTGATCCACGCAGAGGAGGGAGAGGACCGGTTTGCAGATCTC1466                           AsnAspLeuIleHisAlaGluGluGlyGluAspArgPheAlaAspLeu                               430435440                                                                      TGTGCTGTTGGATCCCCCTTCGCCCGCGCCAGCCTCAAGAGCGGGAAG1514                           CysAlaValGlySerProPheAlaArgAlaSerLeuLysSerGlyLys                               445450455                                                                      ACAGAGAGCTCGTCATACTTCCGGAGGAAGGAGAAGATGTTCCGGTTT1562                           ThrGluSerSerSerTyrPheArgArgLysGluLysMetPheArgPhe                               460465470                                                                      TTTATCCGGCGCATGGTGAAGGCTCAGAGCTTCTACTGGGTGGTGCTG1610                           PheIleArgArgMetValLysAlaGlnSerPheTyrTrpValValLeu                               475480485                                                                      TGCGTGGTGGCCCTGAACACACTGTGTGTGGCCATGGTGCATTACAAC1658                           CysValValAlaLeuAsnThrLeuCysValAlaMetValHisTyrAsn                               490495500505                                                                   CAGCCGCGGCGGCTTACCACGACCCTGTATTTTGCAGAGTTTGTTTTC1706                           GlnProArgArgLeuThrThrThrLeuTyrPheAlaGluPheValPhe                               510515520                                                                      CTGGGTCTCTTCCTCACAGAGATGTCCCTGAAGATGTATGGCCTGGGG1754                           LeuGlyLeuPheLeuThrGluMetSerLeuLysMetTyrGlyLeuGly                               525530535                                                                      CCCAGAAGCTACTTCCGGTCCTCCTTCAACTGCTTCGACTTTGGGGTC1802                           ProArgSerTyrPheArgSerSerPheAsnCysPheAspPheGlyVal                               540545550                                                                      ATCGTGGGGAGCGTCTTTGAAGTGGTCTGGGCGGCCATCAAGCCGGGA1850                           IleValGlySerValPheGluValValTrpAlaAlaIleLysProGly                               555560565                                                                      AGCTCCTTTGGGATCAGTGTGCTGCGGGCCCTCCGCCTGCTGAGGATC1898                           SerSerPheGlyIleSerValLeuArgAlaLeuArgLeuLeuArgIle                               570575580585                                                                   TTCAAAGTCACGAAGTACTGGAGCTCCCTGCGGAACCTGGTGGTGTCC1946                           PheLysValThrLysTyrTrpSerSerLeuArgAsnLeuValValSer                               590595600                                                                      CTGCTGAACTCCATGAAGTCCATCATCAGCCTGCTCTTCTTGCTCTTC1994                           LeuLeuAsnSerMetLysSerIleIleSerLeuLeuPheLeuLeuPhe                               605610615                                                                      CTGTTCATTGTGGTCTTCGCCCTGCTGGGGATGCAGCTGTTTGGGGGA2042                           LeuPheIleValValPheAlaLeuLeuGlyMetGlnLeuPheGlyGly                               620625630                                                                      CAGTTCAACTTCCAGGATGAGACTCCCACAACCAACTTCGACACCTTC2090                           GlnPheAsnPheGlnAspGluThrProThrThrAsnPheAspThrPhe                               635640645                                                                      CCTGCCGCCATCCTCACTGTCTTCCAGATCCTGACGGGAGAGGACTGG2138                           ProAlaAlaIleLeuThrValPheGlnIleLeuThrGlyGluAspTrp                               650655660665                                                                   AATGCAGTGATGTATCACGGGATCGAATCGCAAGGCGGCGTCAGCAAA2186                           AsnAlaValMetTyrHisGlyIleGluSerGlnGlyGlyValSerLys                               670675680                                                                      GGCATGTTCTCGTCCTTTTACTTCATTGTCCTGACACTGTTCGGAAAC2234                           GlyMetPheSerSerPheTyrPheIleValLeuThrLeuPheGlyAsn                               685690695                                                                      TACACTCTGCTGAATGTCTTTCTGGCCATCGCTGTGGACAACCTGGCC2282                           TyrThrLeuLeuAsnValPheLeuAlaIleAlaValAspAsnLeuAla                               700705710                                                                      AACGCCCAAGAGCTGACCAAGGATGAAGAGGAGATGGAAGAAGCAGCC2330                           AsnAlaGlnGluLeuThrLysAspGluGluGluMetGluGluAlaAla                               715720725                                                                      AATCAGAAGCTTGCTCTGCAAAAGGCCAAAGAAGTGGCTGAAGTCAGC2378                           AsnGlnLysLeuAlaLeuGlnLysAlaLysGluValAlaGluValSer                               730735740745                                                                   CCCATGTCTGCCGCGAACATCTCCATCGCCGCCAGGCAGCAGAACTCG2426                           ProMetSerAlaAlaAsnIleSerIleAlaAlaArgGlnGlnAsnSer                               750755760                                                                      GCCAAGGCGCGCTCGGTGTGGGAGCAGCGGGCCAGCCAGCTACGGCTG2474                           AlaLysAlaArgSerValTrpGluGlnArgAlaSerGlnLeuArgLeu                               765770775                                                                      CAGAACCTGCGGGCCAGCTGCGAGGCGCTGTACAGCGAGATGGACCCC2522                           GlnAsnLeuArgAlaSerCysGluAlaLeuTyrSerGluMetAspPro                               780785790                                                                      GAGGAGCGGCTGCGCTTCGCCACTACGCGCCACCTGCGGCCCGACATG2570                           GluGluArgLeuArgPheAlaThrThrArgHisLeuArgProAspMet                               795800805                                                                      AAGACGCACCTGGACCGGCCGCTGGTGGTGGAGCTGGGCCGCGACGGC2618                           LysThrHisLeuAspArgProLeuValValGluLeuGlyArgAspGly                               810815820825                                                                   GCGCGGGGGCCCGTGGGAGGCAAAGCCCGACCTGAGGCTGCGGAGGCC2666                           AlaArgGlyProValGlyGlyLysAlaArgProGluAlaAlaGluAla                               830835840                                                                      CCCGAGGGCGTCGACCCTCCGCGCAGGCACCACCGGCACCGCGACAAG2714                           ProGluGlyValAspProProArgArgHisHisArgHisArgAspLys                               845850855                                                                      GACAAGACCCCCGCGGCGGGGGACCAGGACCGAGCAGAGGCCCCGAAG2762                           AspLysThrProAlaAlaGlyAspGlnAspArgAlaGluAlaProLys                               860865870                                                                      GCGGAGAGCGGGGAGCCCGGTGCCCGGGAGGAGCGGCCGCGGCCGCAC2810                           AlaGluSerGlyGluProGlyAlaArgGluGluArgProArgProHis                               875880885                                                                      CGCAGCCACAGCAAGGAGGCCGCGGGGCCCCCGGAGGCGCGGAGCGAG2858                           ArgSerHisSerLysGluAlaAlaGlyProProGluAlaArgSerGlu                               890895900905                                                                   CGCGGCCGAGGCCCAGGCCCCGAGGGCGGCCGGCGGCACCACCGGCGC2906                           ArgGlyArgGlyProGlyProGluGlyGlyArgArgHisHisArgArg                               910915920                                                                      GGCTCCCCGGAGGAGGCGGCCGAGCGGGAGCCCCGACGCCACCGCGCG2954                           GlySerProGluGluAlaAlaGluArgGluProArgArgHisArgAla                               925930935                                                                      CACCGGCACCAGGATCCGAGCAAGGAGTGCGCCGGCGCCAAGGGCGAG3002                           HisArgHisGlnAspProSerLysGluCysAlaGlyAlaLysGlyGlu                               940945950                                                                      CGGCGCGCGCGGCACCGCGGCGGCCCCCGAGCGGGGCCCCGGGAGGCG3050                           ArgArgAlaArgHisArgGlyGlyProArgAlaGlyProArgGluAla                               955960965                                                                      GAGAGCGGGGAGGAGCCGGCGCGGCGGCACCGGGCCCGGCACAAGGCG3098                           GluSerGlyGluGluProAlaArgArgHisArgAlaArgHisLysAla                               970975980985                                                                   CAGCCTGCTCACGAGGCTGTGGAGAAGGAGACCACGGAGAAGGAGGCC3146                           GlnProAlaHisGluAlaValGluLysGluThrThrGluLysGluAla                               9909951000                                                                     ACGGAGAAGGAGGCTGAGATAGTGGAAGCCGACAAGGAAAAGGAGCTC3194                           ThrGluLysGluAlaGluIleValGluAlaAspLysGluLysGluLeu                               100510101015                                                                   CGGAACCACCAGCCCCGGGAGCCACACTGTGACCTGGAGACCAGTGGG3242                           ArgAsnHisGlnProArgGluProHisCysAspLeuGluThrSerGly                               102010251030                                                                   ACTGTGACTGTGGGTCCCATGCACACACTGCCCAGCACCTGTCTCCAG3290                           ThrValThrValGlyProMetHisThrLeuProSerThrCysLeuGln                               103510401045                                                                   AAGGTGGAGGAACAGCCAGAGGATGCAGACAATCAGCGGAACGTCACT3338                           LysValGluGluGlnProGluAspAlaAspAsnGlnArgAsnValThr                               1050105510601065                                                               CGCATGGGCAGTCAGCCCCCAGACCCGAACACTATTGTACATATCCCA3386                           ArgMetGlySerGlnProProAspProAsnThrIleValHisIlePro                               107010751080                                                                   GTGATGCTGACGGGCCCTCTTGGGGAAGCCACGGTCGTTCCCAGTGGT3434                           ValMetLeuThrGlyProLeuGlyGluAlaThrValValProSerGly                               108510901095                                                                   AACGTGGACCTGGAAAGCCAAGCAGAGGGGAAGAAGGAGGTGGAAGCG3482                           AsnValAspLeuGluSerGlnAlaGluGlyLysLysGluValGluAla                               110011051110                                                                   GATGACGTGATGAGGAGCGGCCCCCGGCCTATCGTCCCATACAGCTCC3530                           AspAspValMetArgSerGlyProArgProIleValProTyrSerSer                               111511201125                                                                   ATGTTCTGTTTAAGCCCCACCAACCTGCTCCGCCGCTTCTGCCACTAC3578                           MetPheCysLeuSerProThrAsnLeuLeuArgArgPheCysHisTyr                               1130113511401145                                                               ATCGTGACCATGAGGTACTTCGAGGTGGTCATTCTCGTGGTCATCGCC3626                           IleValThrMetArgTyrPheGluValValIleLeuValValIleAla                               115011551160                                                                   TTGAGCAGCATCGCCCTGGCTGCTGAGGACCCAGTGCGCACAGACTCG3674                           LeuSerSerIleAlaLeuAlaAlaGluAspProValArgThrAspSer                               116511701175                                                                   CCCAGGAACAACGCTCTGAAATACCTGGATTACATTTTCACTGGTGTC3722                           ProArgAsnAsnAlaLeuLysTyrLeuAspTyrIlePheThrGlyVal                               118011851190                                                                   TTTACCTTTGAGATGGTGATAAAGATGATCGACTTGGGACTGCTGCTT3770                           PheThrPheGluMetValIleLysMetIleAspLeuGlyLeuLeuLeu                               119512001205                                                                   CACCCTGGAGCCTATTTCCGGGACTTGTGGAACATTCTGGACTTCATT3818                           HisProGlyAlaTyrPheArgAspLeuTrpAsnIleLeuAspPheIle                               1210121512201225                                                               GTGGTCAGTGGCGCCCTGGTGGCGTTTGCTTTCTCAGGATCCAAAGGG3866                           ValValSerGlyAlaLeuValAlaPheAlaPheSerGlySerLysGly                               123012351240                                                                   AAAGACATCAATACCATCAAGTCTCTGAGAGTCCTTCGTGTCCTGCGG3914                           LysAspIleAsnThrIleLysSerLeuArgValLeuArgValLeuArg                               124512501255                                                                   CCCCTCAAGACCATCAAACGGCTGCCCAAGCTCAAGGCTGTGTTTGAC3962                           ProLeuLysThrIleLysArgLeuProLysLeuLysAlaValPheAsp                               126012651270                                                                   TGTGTGGTGAACTCCCTGAAGAATGTCCTCAACATCTTGATTGTCTAC4010                           CysValValAsnSerLeuLysAsnValLeuAsnIleLeuIleValTyr                               127512801285                                                                   ATGCTCTTCATGTTCATATTTGCCGTCATTGCGGTGCAGCTCTTCAAA4058                           MetLeuPheMetPheIlePheAlaValIleAlaValGlnLeuPheLys                               1290129513001305                                                               GGGAAGTTTTTCTACTGCACAGATGAATCCAAGGAGCTGGAGAGGGAC4106                           GlyLysPhePheTyrCysThrAspGluSerLysGluLeuGluArgAsp                               131013151320                                                                   TGCAGGGGTCAGTATTTGGATTATGAGAAGGAGGAAGTGGAAGCTCAG4154                           CysArgGlyGlnTyrLeuAspTyrGluLysGluGluValGluAlaGln                               132513301335                                                                   CCCAGGCAGTGGAAGAAATACGACTTTCACTACGACAATGTGCTCTGG4202                           ProArgGlnTrpLysLysTyrAspPheHisTyrAspAsnValLeuTrp                               134013451350                                                                   GCTCTGCTGACGCTGTTCACAGTGTCCACGGGAGAAGGCTGGCCCATG4250                           AlaLeuLeuThrLeuPheThrValSerThrGlyGluGlyTrpProMet                               135513601365                                                                   GTGCTGAAACACTCCGTGGATGCCACCTATGAGGAGCAGGGTCCAAGC4298                           ValLeuLysHisSerValAspAlaThrTyrGluGluGlnGlyProSer                               1370137513801385                                                               CCTGGGTACCGCATGGAGCTGTCCATCTTCTACGTGGTCTACTTTGTG4346                           ProGlyTyrArgMetGluLeuSerIlePheTyrValValTyrPheVal                               139013951400                                                                   GTCTTTCCCTTCTTCTTCGTCAACATCTTTGTGGCTTTGATCATCATC4394                           ValPheProPhePhePheValAsnIlePheValAlaLeuIleIleIle                               140514101415                                                                   ACCTTCCAGGAGCAGGGGGACAAGGTGATGTCTGAATGCAGCCTGGAG4442                           ThrPheGlnGluGlnGlyAspLysValMetSerGluCysSerLeuGlu                               142014251430                                                                   AAGAACGAGAGGGCTTGCATTGACTTCGCCATCAGCGCCAAACCCCTG4490                           LysAsnGluArgAlaCysIleAspPheAlaIleSerAlaLysProLeu                               143514401445                                                                   ACACGGTACATGCCCCAAAACCGGCAGTCGTTCCAGTATAAGACGTGG4538                           ThrArgTyrMetProGlnAsnArgGlnSerPheGlnTyrLysThrTrp                               1450145514601465                                                               ACATTTGTGGTCTCCCCGCCCTTTGAATACTTCATCATGGCCATGATA4586                           ThrPheValValSerProProPheGluTyrPheIleMetAlaMetIle                               147014751480                                                                   GCCCTCAACACTGTGGTGCTGATGATGAAGTTCTATGATGCACCCTAT4634                           AlaLeuAsnThrValValLeuMetMetLysPheTyrAspAlaProTyr                               148514901495                                                                   GAGTACGAGCTGATGCTGAAATGCCTGAACATCGTGTTCACATCCATG4682                           GluTyrGluLeuMetLeuLysCysLeuAsnIleValPheThrSerMet                               150015051510                                                                   TTCTCCATGGAATGCGTGCTGAAGATCATCGCCTTTGGGGTGCTGAAC4730                           PheSerMetGluCysValLeuLysIleIleAlaPheGlyValLeuAsn                               151515201525                                                                   TATTTCAGAGATGCCTGGAATGTCTTTGACTTTGTCACTGTGTTGGGA4778                           TyrPheArgAspAlaTrpAsnValPheAspPheValThrValLeuGly                               1530153515401545                                                               AGTATTACTGATATTTTAGTAACAGAGATTGCGGAAACGAACAATTTC4826                           SerIleThrAspIleLeuValThrGluIleAlaGluThrAsnAsnPhe                               155015551560                                                                   ATCAACCTCAGCTTCCTCCGCCTCTTTCGAGCTGCGCGGCTGATCAAG4874                           IleAsnLeuSerPheLeuArgLeuPheArgAlaAlaArgLeuIleLys                               156515701575                                                                   CTGCTCCGCCAGGGCTACACCATCCGCATCCTGCTGTGGACCTTTGTC4922                           LeuLeuArgGlnGlyTyrThrIleArgIleLeuLeuTrpThrPheVal                               158015851590                                                                   CAGTCCTTCAAGGCCCTGCCCTACGTGTGTCTGCTCATTGCCATGCTG4970                           GlnSerPheLysAlaLeuProTyrValCysLeuLeuIleAlaMetLeu                               159516001605                                                                   TTCTTCATCTACGCCATCATCGGCATGCAGGTGTTTGGGAATATTGCC5018                           PhePheIleTyrAlaIleIleGlyMetGlnValPheGlyAsnIleAla                               1610161516201625                                                               CTGGATGATGACACCAGCATCAACCGCCACAACAACTTCCGGACGTTT5066                           LeuAspAspAspThrSerIleAsnArgHisAsnAsnPheArgThrPhe                               163016351640                                                                   TTGCAAGCCCTGATGCTGCTGTTCAGGAGCGCCACGGGGGAGGCCTGG5114                           LeuGlnAlaLeuMetLeuLeuPheArgSerAlaThrGlyGluAlaTrp                               164516501655                                                                   CACGAGATCATGCTGTCCTGCCTGAGCAACCAGGCCTGTGATGAGCAG5162                           HisGluIleMetLeuSerCysLeuSerAsnGlnAlaCysAspGluGln                               166016651670                                                                   GCCAATGCCACCGAGTGTGGAAGTGACTTTGCCTACTTCTACTTCGTC5210                           AlaAsnAlaThrGluCysGlySerAspPheAlaTyrPheTyrPheVal                               167516801685                                                                   TCCTTCATCTTCCTGTGCTCCTTTCTGATGTTGAACCTCTTTGTGGCT5258                           SerPheIlePheLeuCysSerPheLeuMetLeuAsnLeuPheValAla                               1690169517001705                                                               GTGATCATGGACAATTTTGAGTACCTCACGCGGGACTCTTCCATCCTA5306                           ValIleMetAspAsnPheGluTyrLeuThrArgAspSerSerIleLeu                               171017151720                                                                   GGTCCTCACCACTTGGATGAGTTCATCCGGGTCTGGGCTGAATACGAC5354                           GlyProHisHisLeuAspGluPheIleArgValTrpAlaGluTyrAsp                               172517301735                                                                   CCGGCTGCGTGTGGGCGCATCAGTTACAATGACATGTTTGAGATGCTG5402                           ProAlaAlaCysGlyArgIleSerTyrAsnAspMetPheGluMetLeu                               174017451750                                                                   AAACACATGTCCCCGCCTCTGGGGCTGGGGAAGAAATGCCCTGCTCGA5450                           LysHisMetSerProProLeuGlyLeuGlyLysLysCysProAlaArg                               175517601765                                                                   GTTGCTTACAAGCGCCTGGTTCGCATGAACATGCCCATCTCCAACGAG5498                           ValAlaTyrLysArgLeuValArgMetAsnMetProIleSerAsnGlu                               1770177517801785                                                               GACATGACTGTTCACTTCACGTCCACGCTGATGGCCCTCATCCGGACG5546                           AspMetThrValHisPheThrSerThrLeuMetAlaLeuIleArgThr                               179017951800                                                                   GCACTGGAGATCAAGCTGGCCCCAGCTGGGACAAAGCAGCATCAGTGT5594                           AlaLeuGluIleLysLeuAlaProAlaGlyThrLysGlnHisGlnCys                               180518101815                                                                   GACGCGGAGTTGAGGAAGGAGATTTCCGTTGTGTGGGCCAATCTGCCC5642                           AspAlaGluLeuArgLysGluIleSerValValTrpAlaAsnLeuPro                               182018251830                                                                   CAGAAGACTTTGGACTTGCTGGTACCACCCCATAAGCCTGATGAGATG5690                           GlnLysThrLeuAspLeuLeuValProProHisLysProAspGluMet                               183518401845                                                                   ACAGTGGGGAAGGTTTATGCAGCTCTGATGATATTTGACTTCTACAAG5738                           ThrValGlyLysValTyrAlaAlaLeuMetIlePheAspPheTyrLys                               1850185518601865                                                               CAGAACAAAACCACCAGAGACCAGATGCAGCAGGCTCCTGGAGGCCTC5786                           GlnAsnLysThrThrArgAspGlnMetGlnGlnAlaProGlyGlyLeu                               187018751880                                                                   TCCCAGATGGGTCCTGTGTCCCTGTTCCACCCTCTGAAGGCCACCCTG5834                           SerGlnMetGlyProValSerLeuPheHisProLeuLysAlaThrLeu                               188518901895                                                                   GAGCAGACACAGCCGGCTGTGCTCCGAGGAGCCCGGGTTTTCCTTCGA5882                           GluGlnThrGlnProAlaValLeuArgGlyAlaArgValPheLeuArg                               190019051910                                                                   CAGAAGAGTTCCACCTCCCTCAGCAATGGCGGGGCCATACAAAACCAA5930                           GlnLysSerSerThrSerLeuSerAsnGlyGlyAlaIleGlnAsnGln                               191519201925                                                                   GAGAGTGGCATCAAAGAGTCTGTCTCCTGGGGCACTCAAAGGACCCAG5978                           GluSerGlyIleLysGluSerValSerTrpGlyThrGlnArgThrGln                               1930193519401945                                                               GATGCACCCCATGAGGCCAGGCCACCCCTGGAGCGTGGCCACTCCACA6026                           AspAlaProHisGluAlaArgProProLeuGluArgGlyHisSerThr                               195019551960                                                                   GAGATCCCTGTGGGGCGGTCAGGAGCACTGGCTGTGGACGTTCAGATG6074                           GluIleProValGlyArgSerGlyAlaLeuAlaValAspValGlnMet                               196519701975                                                                   CAGAGCATAACCCGGAGGGGCCCTGATGGGGAGCCCCAGCCTGGGCTG6122                           GlnSerIleThrArgArgGlyProAspGlyGluProGlnProGlyLeu                               198019851990                                                                   GAGAGCCAGGGTCGAGCGGCCTCCATGCCCCGCCTTGCGGCCGAGACT6170                           GluSerGlnGlyArgAlaAlaSerMetProArgLeuAlaAlaGluThr                               199520002005                                                                   CAGCCCGTCACAGATGCCAGCCCCATGAAGCGCTCCATCTCCACGCTG6218                           GlnProValThrAspAlaSerProMetLysArgSerIleSerThrLeu                               2010201520202025                                                               GCCCAGCGGCCCCGTGGGACTCATCTTTGCAGCACCACCCCGGACCGC6266                           AlaGlnArgProArgGlyThrHisLeuCysSerThrThrProAspArg                               203020352040                                                                   CCACCCCCTAGCCAGGCGTCGTCGCACCACCACCACCACCGCTGCCAC6314                           ProProProSerGlnAlaSerSerHisHisHisHisHisArgCysHis                               204520502055                                                                   CGCCGCAGGGACAGGAAGCAGAGGTCCCTGGAGAAGGGGCCCAGCCTG6362                           ArgArgArgAspArgLysGlnArgSerLeuGluLysGlyProSerLeu                               206020652070                                                                   TCTGCCGATATGGATGGCGCACCAAGCAGTGCTGTGGGGCCGGGGCTG6410                           SerAlaAspMetAspGlyAlaProSerSerAlaValGlyProGlyLeu                               207520802085                                                                   CCCCCGGGAGAGGGGCCTACAGGCTGCCGGCGGGAACGAGAGCGCCGG6458                           ProProGlyGluGlyProThrGlyCysArgArgGluArgGluArgArg                               2090209521002105                                                               CAGGAGCGGGGCCGGTCCCAGGAGCGGAGGCAGCCCTCATCCTCCTCC6506                           GlnGluArgGlyArgSerGlnGluArgArgGlnProSerSerSerSer                               211021152120                                                                   TCGGAGAAGCAGCGCTTCTACTCCTGCGACCGCTTTGGGGGCCGTGAG6554                           SerGluLysGlnArgPheTyrSerCysAspArgPheGlyGlyArgGlu                               212521302135                                                                   CCCCCGAAGCCCAAGCCCTCCCTCAGCAGCCACCCAACGTCGCCAACA6602                           ProProLysProLysProSerLeuSerSerHisProThrSerProThr                               214021452150                                                                   GCTGGCCAGGAGCCGGGACCCCACCCACAGGGCAGTGGTTCCGTGAAT6650                           AlaGlyGlnGluProGlyProHisProGlnGlySerGlySerValAsn                               215521602165                                                                   GGGAGCCCCTTGCTGTCAACATCTGGTGCTAGCACCCCCGGCCGCGGT6698                           GlySerProLeuLeuSerThrSerGlyAlaSerThrProGlyArgGly                               2170217521802185                                                               GGGCGGAGGCAGCTCCCCCAGACGCCCCTGACTCCCCGCCCCAGCATC6746                           GlyArgArgGlnLeuProGlnThrProLeuThrProArgProSerIle                               219021952200                                                                   ACCTACAAGACGGCCAACTCCTCACCCATCCACTTCGCCGGGGCTCAG6794                           ThrTyrLysThrAlaAsnSerSerProIleHisPheAlaGlyAlaGln                               220522102215                                                                   ACCAGCCTCCCTGCCTTCTCCCCAGGCCGGCTCAGCCGTGGGCTTTCC6842                           ThrSerLeuProAlaPheSerProGlyArgLeuSerArgGlyLeuSer                               222022252230                                                                   GAACACAACGCCCTGCTGCAGAGAGACCCCCTCAGCCAGCCCCTGGCC6890                           GluHisAsnAlaLeuLeuGlnArgAspProLeuSerGlnProLeuAla                               223522402245                                                                   CCTGGCTCTCGAATTGGCTCTGACCCTTACCTGGGGCAGCGTCTGGAC6938                           ProGlySerArgIleGlySerAspProTyrLeuGlyGlnArgLeuAsp                               2250225522602265                                                               AGTGAGGCCTCTGTCCACGCCCTGCCTGAGGACACGCTCACTTTCGAG6986                           SerGluAlaSerValHisAlaLeuProGluAspThrLeuThrPheGlu                               227022752280                                                                   GAGGCTGTGGCCACCAACTCGGGCCGCTCCTCCAGGACTTCCTACGTG7034                           GluAlaValAlaThrAsnSerGlyArgSerSerArgThrSerTyrVal                               228522902295                                                                   TCCTCCCTGACCTCCCAGTCTCACCCTCTCCGCCGCGTGCCCAACGGT7082                           SerSerLeuThrSerGlnSerHisProLeuArgArgValProAsnGly                               230023052310                                                                   TACCACTGCACCCTGGGACTCAGCTCGGGTGGCCGAGCACGGCACAGC7130                           TyrHisCysThrLeuGlyLeuSerSerGlyGlyArgAlaArgHisSer                               231523202325                                                                   TACCACCACCCTGACCAAGACCACTGGTGCTAGCTGCACCGTGACCGCTC7180                         TyrHisHisProAspGlnAspHisTrpCys                                                 23302335                                                                       AGACGCCTGCATGCAGCAGGCGTGTGTTCCAGTGGATGAGTTTTATCATCCACACGGGGC7240               AGTCGGCCCTCGGGGGAGGCCTTGCCCACCTTGGTGAGGCTCCTGTGGCCCCTCCCTCCC7300               CCTCCTCCCCTCTTTTACTCTAGACGACGAATAAAGCCCTGTTGCTTGAGTGTACGTACC7360               GC7362                                                                         (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7175 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 144..6857                                                        (ix) FEATURE:                                                                  (A) NAME/KEY: 5'UTR                                                            (B) LOCATION: 1..143                                                           (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 6855..7175                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        GCGGCGGCGGCTGCGGCGGTGGGGCCGGGCGAGGTCCGTGCGGTCCCGGCGGCTCCGTGG60                 CTGCTCCGCTCTGAGCGCCTGCGCGCCCCGCGCCCTCCCTGCCGGGGCCGCTGGGCCGGG120                GATGCACGCGGGGCCCGGGAGCCATGGTCCGCTTCGGGGACGAGCTGGGC170                          MetValArgPheGlyAspGluLeuGly                                                    15                                                                             GGCCGCTATGGAGGCCCCGGCGGCGGAGAGCGGGCCCGGGGCGGCGGG218                            GlyArgTyrGlyGlyProGlyGlyGlyGluArgAlaArgGlyGlyGly                               10152025                                                                       GCCGGCGGGGCGGGGGGCCCGGGTCCCGGGGGGCTGCAGCCCGGCCAG266                            AlaGlyGlyAlaGlyGlyProGlyProGlyGlyLeuGlnProGlyGln                               303540                                                                         CGGGTCCTCTACAAGCAATCGATCGCGCAGCGCGCGCGGACCATGGCG314                            ArgValLeuTyrLysGlnSerIleAlaGlnArgAlaArgThrMetAla                               455055                                                                         CTGTACAACCCCATCCCGGTCAAGCAGAACTGCTTCACCGTCAACCGC362                            LeuTyrAsnProIleProValLysGlnAsnCysPheThrValAsnArg                               606570                                                                         TCGCTCTTCGTCTTCAGCGAGGACAACGTCGTCCGCAAATACGCGAAG410                            SerLeuPheValPheSerGluAspAsnValValArgLysTyrAlaLys                               758085                                                                         CGCATCACCGAGTGGCCTCCATTCGAGAATATGATCCTGGCCACCATC458                            ArgIleThrGluTrpProProPheGluAsnMetIleLeuAlaThrIle                               9095100105                                                                     ATCGCCAACTGCATCGTGCTGGCCCTGGAGCAGCACCTCCCTGATGGG506                            IleAlaAsnCysIleValLeuAlaLeuGluGlnHisLeuProAspGly                               110115120                                                                      GACAAAACGCCCATGTCCGAGCGGCTGGACGACACGGAGCCCTATTTC554                            AspLysThrProMetSerGluArgLeuAspAspThrGluProTyrPhe                               125130135                                                                      ATCGGGATCTTTTGCTTCGAGGCAGGGATCAAAATCATCGCTCTGGGC602                            IleGlyIlePheCysPheGluAlaGlyIleLysIleIleAlaLeuGly                               140145150                                                                      TTTGTCTTCCACAAGGGCTCTTACCTGCGGAACGGCTGGAACGTCATG650                            PheValPheHisLysGlySerTyrLeuArgAsnGlyTrpAsnValMet                               155160165                                                                      GACTTCGTGGTCGTCCTCACAGGGATCCTTGCCACGGCTGGAACTGAC698                            AspPheValValValLeuThrGlyIleLeuAlaThrAlaGlyThrAsp                               170175180185                                                                   TTCGACCTGCGAACACTGAGGGCTGTGCGTGTGCTGAGGCCCCTGAAG746                            PheAspLeuArgThrLeuArgAlaValArgValLeuArgProLeuLys                               190195200                                                                      CTGGTGTCTGGGATTCCAAGTTTGCAGGTGGTGCTCAAGTCCATCATG794                            LeuValSerGlyIleProSerLeuGlnValValLeuLysSerIleMet                               205210215                                                                      AAGGCCATGGTTCCACTCCTGCAGATTGGGCTGCTTCTCTTCTTTGCC842                            LysAlaMetValProLeuLeuGlnIleGlyLeuLeuLeuPhePheAla                               220225230                                                                      ATCCTCATGTTTGCCATCATTGGCCTGGAGTTCTACATGGGCAAGTTC890                            IleLeuMetPheAlaIleIleGlyLeuGluPheTyrMetGlyLysPhe                               235240245                                                                      CACAAGGCCTGTTTCCCCAACAGCACAGATGCGGAGCCCGTGGGTGAC938                            HisLysAlaCysPheProAsnSerThrAspAlaGluProValGlyAsp                               250255260265                                                                   TTCCCCTGTGGCAAGGAGGCCCCAGCCCGGCTGTGCGAGGGCGACACT986                            PheProCysGlyLysGluAlaProAlaArgLeuCysGluGlyAspThr                               270275280                                                                      GAGTGCCGGGAGTACTGGCCAGGACCCAACTTTGGCATCACCAACTTT1034                           GluCysArgGluTyrTrpProGlyProAsnPheGlyIleThrAsnPhe                               285290295                                                                      GACAATATCCTGTTTGCCATCTTGACGGTGTTCCAGTGCATCACCATG1082                           AspAsnIleLeuPheAlaIleLeuThrValPheGlnCysIleThrMet                               300305310                                                                      GAGGGCTGGACTGACATCCTCTATAATACAAACGATGCGGCCGGCAAC1130                           GluGlyTrpThrAspIleLeuTyrAsnThrAsnAspAlaAlaGlyAsn                               315320325                                                                      ACCTGGAACTGGCTCTACTTCATCCCTCTCATCATCATCGGCTCCTTC1178                           ThrTrpAsnTrpLeuTyrPheIleProLeuIleIleIleGlySerPhe                               330335340345                                                                   TTCATGCTCAACCTGGTGCTGGGCGTGCTCTCGGGGGAGTTTGCCAAG1226                           PheMetLeuAsnLeuValLeuGlyValLeuSerGlyGluPheAlaLys                               350355360                                                                      GAGCGAGAGAGGGTGGAGAACCGCCGCGCCTTCCTGAAGCTGCGCCGG1274                           GluArgGluArgValGluAsnArgArgAlaPheLeuLysLeuArgArg                               365370375                                                                      CAGCAGCAGATCGAGCGAGAGCTCAACGGGTACCTGGAGTGGATCTTC1322                           GlnGlnGlnIleGluArgGluLeuAsnGlyTyrLeuGluTrpIlePhe                               380385390                                                                      AAGGCGGAGGAAGTCATGCTGGCCGAGGAGGACAGGAATGCAGAGGAG1370                           LysAlaGluGluValMetLeuAlaGluGluAspArgAsnAlaGluGlu                               395400405                                                                      AAGTCCCCTTTGGACGTGCTGAAGAGAGCGGCCACCAAGAAGAGCAGA1418                           LysSerProLeuAspValLeuLysArgAlaAlaThrLysLysSerArg                               410415420425                                                                   AATGACCTGATCCACGCAGAGGAGGGAGAGGACCGGTTTGCAGATCTC1466                           AsnAspLeuIleHisAlaGluGluGlyGluAspArgPheAlaAspLeu                               430435440                                                                      TGTGCTGTTGGATCCCCCTTCGCCCGCGCCAGCCTCAAGAGCGGGAAG1514                           CysAlaValGlySerProPheAlaArgAlaSerLeuLysSerGlyLys                               445450455                                                                      ACAGAGAGCTCGTCATACTTCCGGAGGAAGGAGAAGATGTTCCGGTTT1562                           ThrGluSerSerSerTyrPheArgArgLysGluLysMetPheArgPhe                               460465470                                                                      TTTATCCGGCGCATGGTGAAGGCTCAGAGCTTCTACTGGGTGGTGCTG1610                           PheIleArgArgMetValLysAlaGlnSerPheTyrTrpValValLeu                               475480485                                                                      TGCGTGGTGGCCCTGAACACACTGTGTGTGGCCATGGTGCATTACAAC1658                           CysValValAlaLeuAsnThrLeuCysValAlaMetValHisTyrAsn                               490495500505                                                                   CAGCCGCGGCGGCTTACCACGACCCTGTATTTTGCAGAGTTTGTTTTC1706                           GlnProArgArgLeuThrThrThrLeuTyrPheAlaGluPheValPhe                               510515520                                                                      CTGGGTCTCTTCCTCACAGAGATGTCCCTGAAGATGTATGGCCTGGGG1754                           LeuGlyLeuPheLeuThrGluMetSerLeuLysMetTyrGlyLeuGly                               525530535                                                                      CCCAGAAGCTACTTCCGGTCCTCCTTCAACTGCTTCGACTTTGGGGTC1802                           ProArgSerTyrPheArgSerSerPheAsnCysPheAspPheGlyVal                               540545550                                                                      ATCGTGGGGAGCGTCTTTGAAGTGGTCTGGGCGGCCATCAAGCCGGGA1850                           IleValGlySerValPheGluValValTrpAlaAlaIleLysProGly                               555560565                                                                      AGCTCCTTTGGGATCAGTGTGCTGCGGGCCCTCCGCCTGCTGAGGATC1898                           SerSerPheGlyIleSerValLeuArgAlaLeuArgLeuLeuArgIle                               570575580585                                                                   TTCAAAGTCACGAAGTACTGGAGCTCCCTGCGGAACCTGGTGGTGTCC1946                           PheLysValThrLysTyrTrpSerSerLeuArgAsnLeuValValSer                               590595600                                                                      CTGCTGAACTCCATGAAGTCCATCATCAGCCTGCTCTTCTTGCTCTTC1994                           LeuLeuAsnSerMetLysSerIleIleSerLeuLeuPheLeuLeuPhe                               605610615                                                                      CTGTTCATTGTGGTCTTCGCCCTGCTGGGGATGCAGCTGTTTGGGGGA2042                           LeuPheIleValValPheAlaLeuLeuGlyMetGlnLeuPheGlyGly                               620625630                                                                      CAGTTCAACTTCCAGGATGAGACTCCCACAACCAACTTCGACACCTTC2090                           GlnPheAsnPheGlnAspGluThrProThrThrAsnPheAspThrPhe                               635640645                                                                      CCTGCCGCCATCCTCACTGTCTTCCAGATCCTGACGGGAGAGGACTGG2138                           ProAlaAlaIleLeuThrValPheGlnIleLeuThrGlyGluAspTrp                               650655660665                                                                   AATGCAGTGATGTATCACGGGATCGAATCGCAAGGCGGCGTCAGCAAA2186                           AsnAlaValMetTyrHisGlyIleGluSerGlnGlyGlyValSerLys                               670675680                                                                      GGCATGTTCTCGTCCTTTTACTTCATTGTCCTGACACTGTTCGGAAAC2234                           GlyMetPheSerSerPheTyrPheIleValLeuThrLeuPheGlyAsn                               685690695                                                                      TACACTCTGCTGAATGTCTTTCTGGCCATCGCTGTGGACAACCTGGCC2282                           TyrThrLeuLeuAsnValPheLeuAlaIleAlaValAspAsnLeuAla                               700705710                                                                      AACGCCCAAGAGCTGACCAAGGATGAAGAGGAGATGGAAGAAGCAGCC2330                           AsnAlaGlnGluLeuThrLysAspGluGluGluMetGluGluAlaAla                               715720725                                                                      AATCAGAAGCTTGCTCTGCAAAAGGCCAAAGAAGTGGCTGAAGTCAGC2378                           AsnGlnLysLeuAlaLeuGlnLysAlaLysGluValAlaGluValSer                               730735740745                                                                   CCCATGTCTGCCGCGAACATCTCCATCGCCGCCAGGCAGCAGAACTCG2426                           ProMetSerAlaAlaAsnIleSerIleAlaAlaArgGlnGlnAsnSer                               750755760                                                                      GCCAAGGCGCGCTCGGTGTGGGAGCAGCGGGCCAGCCAGCTACGGCTG2474                           AlaLysAlaArgSerValTrpGluGlnArgAlaSerGlnLeuArgLeu                               765770775                                                                      CAGAACCTGCGGGCCAGCTGCGAGGCGCTGTACAGCGAGATGGACCCC2522                           GlnAsnLeuArgAlaSerCysGluAlaLeuTyrSerGluMetAspPro                               780785790                                                                      GAGGAGCGGCTGCGCTTCGCCACTACGCGCCACCTGCGGCCCGACATG2570                           GluGluArgLeuArgPheAlaThrThrArgHisLeuArgProAspMet                               795800805                                                                      AAGACGCACCTGGACCGGCCGCTGGTGGTGGAGCTGGGCCGCGACGGC2618                           LysThrHisLeuAspArgProLeuValValGluLeuGlyArgAspGly                               810815820825                                                                   GCGCGGGGGCCCGTGGGAGGCAAAGCCCGACCTGAGGCTGCGGAGGCC2666                           AlaArgGlyProValGlyGlyLysAlaArgProGluAlaAlaGluAla                               830835840                                                                      CCCGAGGGCGTCGACCCTCCGCGCAGGCACCACCGGCACCGCGACAAG2714                           ProGluGlyValAspProProArgArgHisHisArgHisArgAspLys                               845850855                                                                      GACAAGACCCCCGCGGCGGGGGACCAGGACCGAGCAGAGGCCCCGAAG2762                           AspLysThrProAlaAlaGlyAspGlnAspArgAlaGluAlaProLys                               860865870                                                                      GCGGAGAGCGGGGAGCCCGGTGCCCGGGAGGAGCGGCCGCGGCCGCAC2810                           AlaGluSerGlyGluProGlyAlaArgGluGluArgProArgProHis                               875880885                                                                      CGCAGCCACAGCAAGGAGGCCGCGGGGCCCCCGGAGGCGCGGAGCGAG2858                           ArgSerHisSerLysGluAlaAlaGlyProProGluAlaArgSerGlu                               890895900905                                                                   CGCGGCCGAGGCCCAGGCCCCGAGGGCGGCCGGCGGCACCACCGGCGC2906                           ArgGlyArgGlyProGlyProGluGlyGlyArgArgHisHisArgArg                               910915920                                                                      GGCTCCCCGGAGGAGGCGGCCGAGCGGGAGCCCCGACGCCACCGCGCG2954                           GlySerProGluGluAlaAlaGluArgGluProArgArgHisArgAla                               925930935                                                                      CACCGGCACCAGGATCCGAGCAAGGAGTGCGCCGGCGCCAAGGGCGAG3002                           HisArgHisGlnAspProSerLysGluCysAlaGlyAlaLysGlyGlu                               940945950                                                                      CGGCGCGCGCGGCACCGCGGCGGCCCCCGAGCGGGGCCCCGGGAGGCG3050                           ArgArgAlaArgHisArgGlyGlyProArgAlaGlyProArgGluAla                               955960965                                                                      GAGAGCGGGGAGGAGCCGGCGCGGCGGCACCGGGCCCGGCACAAGGCG3098                           GluSerGlyGluGluProAlaArgArgHisArgAlaArgHisLysAla                               970975980985                                                                   CAGCCTGCTCACGAGGCTGTGGAGAAGGAGACCACGGAGAAGGAGGCC3146                           GlnProAlaHisGluAlaValGluLysGluThrThrGluLysGluAla                               9909951000                                                                     ACGGAGAAGGAGGCTGAGATAGTGGAAGCCGACAAGGAAAAGGAGCTC3194                           ThrGluLysGluAlaGluIleValGluAlaAspLysGluLysGluLeu                               100510101015                                                                   CGGAACCACCAGCCCCGGGAGCCACACTGTGACCTGGAGACCAGTGGG3242                           ArgAsnHisGlnProArgGluProHisCysAspLeuGluThrSerGly                               102010251030                                                                   ACTGTGACTGTGGGTCCCATGCACACACTGCCCAGCACCTGTCTCCAG3290                           ThrValThrValGlyProMetHisThrLeuProSerThrCysLeuGln                               103510401045                                                                   AAGGTGGAGGAACAGCCAGAGGATGCAGACAATCAGCGGAACGTCACT3338                           LysValGluGluGlnProGluAspAlaAspAsnGlnArgAsnValThr                               1050105510601065                                                               CGCATGGGCAGTCAGCCCCCAGACCCGAACACTATTGTACATATCCCA3386                           ArgMetGlySerGlnProProAspProAsnThrIleValHisIlePro                               107010751080                                                                   GTGATGCTGACGGGCCCTCTTGGGGAAGCCACGGTCGTTCCCAGTGGT3434                           ValMetLeuThrGlyProLeuGlyGluAlaThrValValProSerGly                               108510901095                                                                   AACGTGGACCTGGAAAGCCAAGCAGAGGGGAAGAAGGAGGTGGAAGCG3482                           AsnValAspLeuGluSerGlnAlaGluGlyLysLysGluValGluAla                               110011051110                                                                   GATGACGTGATGAGGAGCGGCCCCCGGCCTATCGTCCCATACAGCTCC3530                           AspAspValMetArgSerGlyProArgProIleValProTyrSerSer                               111511201125                                                                   ATGTTCTGTTTAAGCCCCACCAACCTGCTCCGCCGCTTCTGCCACTAC3578                           MetPheCysLeuSerProThrAsnLeuLeuArgArgPheCysHisTyr                               1130113511401145                                                               ATCGTGACCATGAGGTACTTCGAGGTGGTCATTCTCGTGGTCATCGCC3626                           IleValThrMetArgTyrPheGluValValIleLeuValValIleAla                               115011551160                                                                   TTGAGCAGCATCGCCCTGGCTGCTGAGGACCCAGTGCGCACAGACTCG3674                           LeuSerSerIleAlaLeuAlaAlaGluAspProValArgThrAspSer                               116511701175                                                                   CCCAGGAACAACGCTCTGAAATACCTGGATTACATTTTCACTGGTGTC3722                           ProArgAsnAsnAlaLeuLysTyrLeuAspTyrIlePheThrGlyVal                               118011851190                                                                   TTTACCTTTGAGATGGTGATAAAGATGATCGACTTGGGACTGCTGCTT3770                           PheThrPheGluMetValIleLysMetIleAspLeuGlyLeuLeuLeu                               119512001205                                                                   CACCCTGGAGCCTATTTCCGGGACTTGTGGAACATTCTGGACTTCATT3818                           HisProGlyAlaTyrPheArgAspLeuTrpAsnIleLeuAspPheIle                               1210121512201225                                                               GTGGTCAGTGGCGCCCTGGTGGCGTTTGCTTTCTCAGGATCCAAAGGG3866                           ValValSerGlyAlaLeuValAlaPheAlaPheSerGlySerLysGly                               123012351240                                                                   AAAGACATCAATACCATCAAGTCTCTGAGAGTCCTTCGTGTCCTGCGG3914                           LysAspIleAsnThrIleLysSerLeuArgValLeuArgValLeuArg                               124512501255                                                                   CCCCTCAAGACCATCAAACGGCTGCCCAAGCTCAAGGCTGTGTTTGAC3962                           ProLeuLysThrIleLysArgLeuProLysLeuLysAlaValPheAsp                               126012651270                                                                   TGTGTGGTGAACTCCCTGAAGAATGTCCTCAACATCTTGATTGTCTAC4010                           CysValValAsnSerLeuLysAsnValLeuAsnIleLeuIleValTyr                               127512801285                                                                   ATGCTCTTCATGTTCATATTTGCCGTCATTGCGGTGCAGCTCTTCAAA4058                           MetLeuPheMetPheIlePheAlaValIleAlaValGlnLeuPheLys                               1290129513001305                                                               GGGAAGTTTTTCTACTGCACAGATGAATCCAAGGAGCTGGAGAGGGAC4106                           GlyLysPhePheTyrCysThrAspGluSerLysGluLeuGluArgAsp                               131013151320                                                                   TGCAGGGGTCAGTATTTGGATTATGAGAAGGAGGAAGTGGAAGCTCAG4154                           CysArgGlyGlnTyrLeuAspTyrGluLysGluGluValGluAlaGln                               132513301335                                                                   CCCAGGCAGTGGAAGAAATACGACTTTCACTACGACAATGTGCTCTGG4202                           ProArgGlnTrpLysLysTyrAspPheHisTyrAspAsnValLeuTrp                               134013451350                                                                   GCTCTGCTGACGCTGTTCACAGTGTCCACGGGAGAAGGCTGGCCCATG4250                           AlaLeuLeuThrLeuPheThrValSerThrGlyGluGlyTrpProMet                               135513601365                                                                   GTGCTGAAACACTCCGTGGATGCCACCTATGAGGAGCAGGGTCCAAGC4298                           ValLeuLysHisSerValAspAlaThrTyrGluGluGlnGlyProSer                               1370137513801385                                                               CCTGGGTACCGCATGGAGCTGTCCATCTTCTACGTGGTCTACTTTGTG4346                           ProGlyTyrArgMetGluLeuSerIlePheTyrValValTyrPheVal                               139013951400                                                                   GTCTTTCCCTTCTTCTTCGTCAACATCTTTGTGGCTTTGATCATCATC4394                           ValPheProPhePhePheValAsnIlePheValAlaLeuIleIleIle                               140514101415                                                                   ACCTTCCAGGAGCAGGGGGACAAGGTGATGTCTGAATGCAGCCTGGAG4442                           ThrPheGlnGluGlnGlyAspLysValMetSerGluCysSerLeuGlu                               142014251430                                                                   AAGAACGAGAGGGCTTGCATTGACTTCGCCATCAGCGCCAAACCCCTG4490                           LysAsnGluArgAlaCysIleAspPheAlaIleSerAlaLysProLeu                               143514401445                                                                   ACACGGTACATGCCCCAAAACCGGCAGTCGTTCCAGTATAAGACGTGG4538                           ThrArgTyrMetProGlnAsnArgGlnSerPheGlnTyrLysThrTrp                               1450145514601465                                                               ACATTTGTGGTCTCCCCGCCCTTTGAATACTTCATCATGGCCATGATA4586                           ThrPheValValSerProProPheGluTyrPheIleMetAlaMetIle                               147014751480                                                                   GCCCTCAACACTGTGGTGCTGATGATGAAGTTCTATGATGCACCCTAT4634                           AlaLeuAsnThrValValLeuMetMetLysPheTyrAspAlaProTyr                               148514901495                                                                   GAGTACGAGCTGATGCTGAAATGCCTGAACATCGTGTTCACATCCATG4682                           GluTyrGluLeuMetLeuLysCysLeuAsnIleValPheThrSerMet                               150015051510                                                                   TTCTCCATGGAATGCGTGCTGAAGATCATCGCCTTTGGGGTGCTGAAC4730                           PheSerMetGluCysValLeuLysIleIleAlaPheGlyValLeuAsn                               151515201525                                                                   TATTTCAGAGATGCCTGGAATGTCTTTGACTTTGTCACTGTGTTGGGA4778                           TyrPheArgAspAlaTrpAsnValPheAspPheValThrValLeuGly                               1530153515401545                                                               AGTATTACTGATATTTTAGTAACAGAGATTGCGGAAACGAACAATTTC4826                           SerIleThrAspIleLeuValThrGluIleAlaGluThrAsnAsnPhe                               155015551560                                                                   ATCAACCTCAGCTTCCTCCGCCTCTTTCGAGCTGCGCGGCTGATCAAG4874                           IleAsnLeuSerPheLeuArgLeuPheArgAlaAlaArgLeuIleLys                               156515701575                                                                   CTGCTCCGCCAGGGCTACACCATCCGCATCCTGCTGTGGACCTTTGTC4922                           LeuLeuArgGlnGlyTyrThrIleArgIleLeuLeuTrpThrPheVal                               158015851590                                                                   CAGTCCTTCAAGGCCCTGCCCTACGTGTGTCTGCTCATTGCCATGCTG4970                           GlnSerPheLysAlaLeuProTyrValCysLeuLeuIleAlaMetLeu                               159516001605                                                                   TTCTTCATCTACGCCATCATCGGCATGCAGGTGTTTGGGAATATTGCC5018                           PhePheIleTyrAlaIleIleGlyMetGlnValPheGlyAsnIleAla                               1610161516201625                                                               CTGGATGATGACACCAGCATCAACCGCCACAACAACTTCCGGACGTTT5066                           LeuAspAspAspThrSerIleAsnArgHisAsnAsnPheArgThrPhe                               163016351640                                                                   TTGCAAGCCCTGATGCTGCTGTTCAGGAGCGCCACGGGGGAGGCCTGG5114                           LeuGlnAlaLeuMetLeuLeuPheArgSerAlaThrGlyGluAlaTrp                               164516501655                                                                   CACGAGATCATGCTGTCCTGCCTGAGCAACCAGGCCTGTGATGAGCAG5162                           HisGluIleMetLeuSerCysLeuSerAsnGlnAlaCysAspGluGln                               166016651670                                                                   GCCAATGCCACCGAGTGTGGAAGTGACTTTGCCTACTTCTACTTCGTC5210                           AlaAsnAlaThrGluCysGlySerAspPheAlaTyrPheTyrPheVal                               167516801685                                                                   TCCTTCATCTTCCTGTGCTCCTTTCTGATGTTGAACCTCTTTGTGGCT5258                           SerPheIlePheLeuCysSerPheLeuMetLeuAsnLeuPheValAla                               1690169517001705                                                               GTGATCATGGACAATTTTGAGTACCTCACGCGGGACTCTTCCATCCTA5306                           ValIleMetAspAsnPheGluTyrLeuThrArgAspSerSerIleLeu                               171017151720                                                                   GGTCCTCACCACTTGGATGAGTTCATCCGGGTCTGGGCTGAATACGAC5354                           GlyProHisHisLeuAspGluPheIleArgValTrpAlaGluTyrAsp                               172517301735                                                                   CCGGCTGCGTGTGGGCGCATCAGTTACAATGACATGTTTGAGATGCTG5402                           ProAlaAlaCysGlyArgIleSerTyrAsnAspMetPheGluMetLeu                               174017451750                                                                   AAACACATGTCCCCGCCTCTGGGGCTGGGGAAGAAATGCCCTGCTCGA5450                           LysHisMetSerProProLeuGlyLeuGlyLysLysCysProAlaArg                               175517601765                                                                   GTTGCTTACAAGCGCCTGGTTCGCATGAACATGCCCATCTCCAACGAG5498                           ValAlaTyrLysArgLeuValArgMetAsnMetProIleSerAsnGlu                               1770177517801785                                                               GACATGACTGTTCACTTCACGTCCACGCTGATGGCCCTCATCCGGACG5546                           AspMetThrValHisPheThrSerThrLeuMetAlaLeuIleArgThr                               179017951800                                                                   GCACTGGAGATCAAGCTGGCCCCAGCTGGGACAAAGCAGCATCAGTGT5594                           AlaLeuGluIleLysLeuAlaProAlaGlyThrLysGlnHisGlnCys                               180518101815                                                                   GACGCGGAGTTGAGGAAGGAGATTTCCGTTGTGTGGGCCAATCTGCCC5642                           AspAlaGluLeuArgLysGluIleSerValValTrpAlaAsnLeuPro                               182018251830                                                                   CAGAAGACTTTGGACTTGCTGGTACCACCCCATAAGCCTGATGAGATG5690                           GlnLysThrLeuAspLeuLeuValProProHisLysProAspGluMet                               183518401845                                                                   ACAGTGGGGAAGGTTTATGCAGCTCTGATGATATTTGACTTCTACAAG5738                           ThrValGlyLysValTyrAlaAlaLeuMetIlePheAspPheTyrLys                               1850185518601865                                                               CAGAACAAAACCACCAGAGACCAGATGCAGCAGGCTCCTGGAGGCCTC5786                           GlnAsnLysThrThrArgAspGlnMetGlnGlnAlaProGlyGlyLeu                               187018751880                                                                   TCCCAGATGGGTCCTGTGTCCCTGTTCCACCCTCTGAAGGCCACCCTG5834                           SerGlnMetGlyProValSerLeuPheHisProLeuLysAlaThrLeu                               188518901895                                                                   GAGCAGACACAGCCGGCTGTGCTCCGAGGAGCCCGGGTTTTCCTTCGA5882                           GluGlnThrGlnProAlaValLeuArgGlyAlaArgValPheLeuArg                               190019051910                                                                   CAGAAGAGTTCCACCTCCCTCAGCAATGGCGGGGCCATACAAAACCAA5930                           GlnLysSerSerThrSerLeuSerAsnGlyGlyAlaIleGlnAsnGln                               191519201925                                                                   GAGAGTGGCATCAAAGAGTCTGTCTCCTGGGGCACTCAAAGGACCCAG5978                           GluSerGlyIleLysGluSerValSerTrpGlyThrGlnArgThrGln                               1930193519401945                                                               GATGCACCCCATGAGGCCAGGCCACCCCTGGAGCGTGGCCACTCCACA6026                           AspAlaProHisGluAlaArgProProLeuGluArgGlyHisSerThr                               195019551960                                                                   GAGATCCCTGTGGGGCGGTCAGGAGCACTGGCTGTGGACGTTCAGATG6074                           GluIleProValGlyArgSerGlyAlaLeuAlaValAspValGlnMet                               196519701975                                                                   CAGAGCATAACCCGGAGGGGCCCTGATGGGGAGCCCCAGCCTGGGCTG6122                           GlnSerIleThrArgArgGlyProAspGlyGluProGlnProGlyLeu                               198019851990                                                                   GAGAGCCAGGGTCGAGCGGCCTCCATGCCCCGCCTTGCGGCCGAGACT6170                           GluSerGlnGlyArgAlaAlaSerMetProArgLeuAlaAlaGluThr                               199520002005                                                                   CAGCCCGTCACAGATGCCAGCCCCATGAAGCGCTCCATCTCCACGCTG6218                           GlnProValThrAspAlaSerProMetLysArgSerIleSerThrLeu                               2010201520202025                                                               GCCCAGCGGCCCCGTGGGACTCATCTTTGCAGCACCACCCCGGACCGC6266                           AlaGlnArgProArgGlyThrHisLeuCysSerThrThrProAspArg                               203020352040                                                                   CCACCCCCTAGCCAGGCGTCGTCGCACCACCACCACCACCGCTGCCAC6314                           ProProProSerGlnAlaSerSerHisHisHisHisHisArgCysHis                               204520502055                                                                   CGCCGCAGGGACAGGAAGCAGAGGTCCCTGGAGAAGGGGCCCAGCCTG6362                           ArgArgArgAspArgLysGlnArgSerLeuGluLysGlyProSerLeu                               206020652070                                                                   TCTGCCGATATGGATGGCGCACCAAGCAGTGCTGTGGGGCCGGGGCTG6410                           SerAlaAspMetAspGlyAlaProSerSerAlaValGlyProGlyLeu                               207520802085                                                                   CCCCCGGGAGAGGGGCCTACAGGCTGCCGGCGGGAACGAGAGCGCCGG6458                           ProProGlyGluGlyProThrGlyCysArgArgGluArgGluArgArg                               2090209521002105                                                               CAGGAGCGGGGCCGGTCCCAGGAGCGGAGGCAGCCCTCATCCTCCTCC6506                           GlnGluArgGlyArgSerGlnGluArgArgGlnProSerSerSerSer                               211021152120                                                                   TCGGAGAAGCAGCGCTTCTACTCCTGCGACCGCTTTGGGGGCCGTGAG6554                           SerGluLysGlnArgPheTyrSerCysAspArgPheGlyGlyArgGlu                               212521302135                                                                   CCCCCGAAGCCCAAGCCCTCCCTCAGCAGCCACCCAACGTCGCCAACA6602                           ProProLysProLysProSerLeuSerSerHisProThrSerProThr                               214021452150                                                                   GCTGGCCAGGAGCCGGGACCCCACCCACAGGCCGGCTCAGCCGTGGGC6650                           AlaGlyGlnGluProGlyProHisProGlnAlaGlySerAlaValGly                               215521602165                                                                   TTTCCGAACACAACGCCCTGCTGCAGAGAGACCCCCTCAGCCAGCCCC6698                           PheProAsnThrThrProCysCysArgGluThrProSerAlaSerPro                               2170217521802185                                                               TGGCCCCTGGCTCTCGAATTGGCTCTGACCCTTACCTGGGGCAGCGTC6746                           TrpProLeuAlaLeuGluLeuAlaLeuThrLeuThrTrpGlySerVal                               219021952200                                                                   TGGACAGTGAGGCCTCTGTCCACGCCCTGCCTGAGGACACGCTCACTT6794                           TrpThrValArgProLeuSerThrProCysLeuArgThrArgSerLeu                               220522102215                                                                   TCGAGGAGGCTGTGGCCACCAACTCGGGCCGCTCCTCCAGGACTTCCT6842                           SerArgArgLeuTrpProProThrArgAlaAlaProProGlyLeuPro                               222022252230                                                                   ACGTGTCCTCCCTGACCTCCCAGTCTCACCCTCTCCGCCGCGTGCCCAACGG6894                       ThrCysProPro                                                                   2235                                                                           TTACCACTGCACCCTGGGACTCAGCTCGGGTGGCCGAGCACGGCACAGCTACCACCACCC6954               TGACCAAGACCACTGGTGCTAGCTGCACCGTGACCGCTCAGACGCCTGCATGCAGCAGGC7014               GTGTGTTCCAGTGGATGAGTTTTATCATCCACACGGGGCAGTCGGCCCTCGGGGGAGGCC7074               TTGCCCACCTTGGTGAGGCTCCTGTGGCCCCTCCCTCCCCCTCCTCCCCTCTTTTACTCT7134               AGACGACGAATAAAGCCCTGTTGCTTGAGTGTACGTACCGC7175                                  (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1546 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..1437                                                          (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 1435..1546                                                       (D) OTHER INFORMATION: /standard_name="Beta 1-2"                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        ATGGTCCAGAAGACCAGCATGTCCCGGGGCCCTTACCCACCCTCCCAG48                             MetValGlnLysThrSerMetSerArgGlyProTyrProProSerGln                               151015                                                                         GAGATCCCCATGGAGGTCTTCGACCCCAGCCCGCAGGGCAAATACAGC96                             GluIleProMetGluValPheAspProSerProGlnGlyLysTyrSer                               202530                                                                         AAGAGGAAAGGGCGATTCAAACGGTCAGATGGGAGCACGTCCTCGGAT144                            LysArgLysGlyArgPheLysArgSerAspGlySerThrSerSerAsp                               354045                                                                         ACCACATCCAACAGCTTTGTCCGCCAGGGCTCAGCGGAGTCCTACACC192                            ThrThrSerAsnSerPheValArgGlnGlySerAlaGluSerTyrThr                               505560                                                                         AGCCGTCCATCAGACTCTGATGTATCTCTGGAGGAGGACCGGGAAGCC240                            SerArgProSerAspSerAspValSerLeuGluGluAspArgGluAla                               65707580                                                                       TTAAGGAAGGAAGCAGAGCGCCAGGCATTAGCGCAGCTCGAGAAGGCC288                            LeuArgLysGluAlaGluArgGlnAlaLeuAlaGlnLeuGluLysAla                               859095                                                                         AAGACCAAGCCAGTGGCATTTGCTGTGCGGACAAATGTTGGCTACAAT336                            LysThrLysProValAlaPheAlaValArgThrAsnValGlyTyrAsn                               100105110                                                                      CCGTCTCCAGGGGATGAGGTGCCTGTGCAGGGAGTGGCCATCACCTTC384                            ProSerProGlyAspGluValProValGlnGlyValAlaIleThrPhe                               115120125                                                                      GAGCCCAAAGACTTCCTGCACATCAAGGAGAAATACAATAATGACTGG432                            GluProLysAspPheLeuHisIleLysGluLysTyrAsnAsnAspTrp                               130135140                                                                      TGGATCGGGCGGCTGGTGAAGGAGGGCTGTGAGGTTGGCTTCATTCCC480                            TrpIleGlyArgLeuValLysGluGlyCysGluValGlyPheIlePro                               145150155160                                                                   AGCCCCGTCAAACTGGACAGCCTTCGCCTGCTGCAGGAACAGAAGCTG528                            SerProValLysLeuAspSerLeuArgLeuLeuGlnGluGlnLysLeu                               165170175                                                                      CGCCAGAACCGCCTCGGCTCCAGCAAATCAGGCGATAACTCCAGTTCC576                            ArgGlnAsnArgLeuGlySerSerLysSerGlyAspAsnSerSerSer                               180185190                                                                      AGTCTGGGAGATGTGGTGACTGGCACCCGCCGCCCCACACCCCCTGCC624                            SerLeuGlyAspValValThrGlyThrArgArgProThrProProAla                               195200205                                                                      AGTGCCAAACAGAAGCAGAAGTCGACAGAGCATGTGCCCCCCTATGAC672                            SerAlaLysGlnLysGlnLysSerThrGluHisValProProTyrAsp                               210215220                                                                      GTGGTGCCTTCCATGAGGCCCATCATCCTGGTGGGACCGTCGCTCAAG720                            ValValProSerMetArgProIleIleLeuValGlyProSerLeuLys                               225230235240                                                                   GGCTACGAGGTTACAGACATGATGCAGAAAGCTTTATTTGACTTCTTG768                            GlyTyrGluValThrAspMetMetGlnLysAlaLeuPheAspPheLeu                               245250255                                                                      AAGCATCGGTTTGATGGCAGGATCTCCATCACTCGTGTGACGGCAGAT816                            LysHisArgPheAspGlyArgIleSerIleThrArgValThrAlaAsp                               260265270                                                                      ATTTCCCTGGCTAAGCGCTCAGTTCTCAACAACCCCAGCAAACACATC864                            IleSerLeuAlaLysArgSerValLeuAsnAsnProSerLysHisIle                               275280285                                                                      ATCATTGAGCGCTCCAACACACGCTCCAGCCTGGCTGAGGTGCAGAGT912                            IleIleGluArgSerAsnThrArgSerSerLeuAlaGluValGlnSer                               290295300                                                                      GAAATCGAGCGAATCTTCGAGCTGGCCCGGACCCTTCAGTTGGTCGCT960                            GluIleGluArgIlePheGluLeuAlaArgThrLeuGlnLeuValAla                               305310315320                                                                   CTGGATGCTGACACCATCAATCACCCAGCCCAGCTGTCCAAGACCTCG1008                           LeuAspAlaAspThrIleAsnHisProAlaGlnLeuSerLysThrSer                               325330335                                                                      CTGGCCCCCATCATTGTTTACATCAAGATCACCTCTCCCAAGGTACTT1056                           LeuAlaProIleIleValTyrIleLysIleThrSerProLysValLeu                               340345350                                                                      CAAAGGCTCATCAAGTCCCGAGGAAAGTCTCAGTCCAAACACCTCAAT1104                           GlnArgLeuIleLysSerArgGlyLysSerGlnSerLysHisLeuAsn                               355360365                                                                      GTCCAAATAGCGGCCTCGGAAAAGCTGGCACAGTGCCCCCCTGAAATG1152                           ValGlnIleAlaAlaSerGluLysLeuAlaGlnCysProProGluMet                               370375380                                                                      TTTGACATCATCCTGGATGAGAACCAATTGGAGGATGCCTGCGAGCAT1200                           PheAspIleIleLeuAspGluAsnGlnLeuGluAspAlaCysGluHis                               385390395400                                                                   CTGGCGGAGTACTTGGAAGCCTATTGGAAGGCCACACACCCGCCCAGC1248                           LeuAlaGluTyrLeuGluAlaTyrTrpLysAlaThrHisProProSer                               405410415                                                                      AGCACGCCACCCAATCCGCTGCTGAACCGCACCATGGCTACCGCAGCC1296                           SerThrProProAsnProLeuLeuAsnArgThrMetAlaThrAlaAla                               420425430                                                                      CTGGCTGCCAGCCCTGCCCCTGTCTCCAACCTCCAGGTACAGGTGCTC1344                           LeuAlaAlaSerProAlaProValSerAsnLeuGlnValGlnValLeu                               435440445                                                                      ACCTCGCTCAGGAGAAACCTCGGCTTCTGGGGCGGGCTGGAGTCCTCA1392                           ThrSerLeuArgArgAsnLeuGlyPheTrpGlyGlyLeuGluSerSer                               450455460                                                                      CAGCGGGGCAGTGTGGTGCCCCAGGAGCAGGAACATGCCATGTAGTGGGCGC1444                       GlnArgGlySerValValProGlnGluGlnGluHisAlaMet                                     465470475                                                                      CCTGCCCGTCTTCCCTCCTGCTCTGGGGTCGGAACTGGAGTGCAGGGAACATGGAGGAGG1504               AAGGGAAGAGCTTTATTTTGTAAAAAAATAAGATGAGCGGCA1546                                 (2) INFORMATION FOR SEQ ID NO:10:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1851 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..1797                                                          (D) OTHER INFORMATION: /standard_name="Beta1-3"                                (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 1795..1851                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                       ATGGTCCAGAAGACCAGCATGTCCCGGGGCCCTTACCCACCCTCCCAG48                             MetValGlnLysThrSerMetSerArgGlyProTyrProProSerGln                               151015                                                                         GAGATCCCCATGGGAGTCTTCGACCCCAGCCCGCAGGGCAAATACAGC96                             GluIleProMetGlyValPheAspProSerProGlnGlyLysTyrSer                               202530                                                                         AAGAGGAAAGGGCGATTCAAACGGTCAGATGGGAGCACGTCCTCGGAT144                            LysArgLysGlyArgPheLysArgSerAspGlySerThrSerSerAsp                               354045                                                                         ACCACATCCAACAGCTTTGTCCGCCAGGGCTCAGCGGAGTCCTACACC192                            ThrThrSerAsnSerPheValArgGlnGlySerAlaGluSerTyrThr                               505560                                                                         AGCCGTCCATCAGACTCTGATGTATCTCTGGAGGAGGACCGGGAAGCC240                            SerArgProSerAspSerAspValSerLeuGluGluAspArgGluAla                               65707580                                                                       TTAAGGAAGGAAGCAGAGCGCCAGGCATTAGCGCAGCTCGAGAAGGCC288                            LeuArgLysGluAlaGluArgGlnAlaLeuAlaGlnLeuGluLysAla                               859095                                                                         AAGACCAAGCCAGTGGCATTTGCTGTGCGGACAAATGTTGGCTACAAT336                            LysThrLysProValAlaPheAlaValArgThrAsnValGlyTyrAsn                               100105110                                                                      CCGTCTCCAGGGGATGAGGTGCCTGTGCAGGGAGTGGCCATCACCTTC384                            ProSerProGlyAspGluValProValGlnGlyValAlaIleThrPhe                               115120125                                                                      GAGCCCAAAGACTTCCTGCACATCAAGGAGAAATACAATAATGACTGG432                            GluProLysAspPheLeuHisIleLysGluLysTyrAsnAsnAspTrp                               130135140                                                                      TGGATCGGGCGGCTGGTGAAGGAGGGCTGTGAGGTTGGCTTCATTCCC480                            TrpIleGlyArgLeuValLysGluGlyCysGluValGlyPheIlePro                               145150155160                                                                   AGCCCCGTCAAACTGGACAGCCTTCGCCTGCTGCAGGAACAGAAGCTG528                            SerProValLysLeuAspSerLeuArgLeuLeuGlnGluGlnLysLeu                               165170175                                                                      CGCCAGAACCGCCTCGGCTCCAGCAAATCAGGCGATAACTCCAGTTCC576                            ArgGlnAsnArgLeuGlySerSerLysSerGlyAspAsnSerSerSer                               180185190                                                                      AGTCTGGGAGATGTGGTGACTGGCACCCGCCGCCCCACACCCCCTGCC624                            SerLeuGlyAspValValThrGlyThrArgArgProThrProProAla                               195200205                                                                      AGTGCCAAACAGAAGCAGAAGTCGACAGAGCATGTGCCCCCCTATGAC672                            SerAlaLysGlnLysGlnLysSerThrGluHisValProProTyrAsp                               210215220                                                                      GTGGTGCCTTCCATGAGGCCCATCATCCTGGTGGGACCGTCGCTCAAG720                            ValValProSerMetArgProIleIleLeuValGlyProSerLeuLys                               225230235240                                                                   GGCTACGAGGTTACAGACATGATGCAGAAAGCTTTATTTGACTTCTTG768                            GlyTyrGluValThrAspMetMetGlnLysAlaLeuPheAspPheLeu                               245250255                                                                      AAGCATCGGTTTGATGGCAGGATCTCCATCACTCGTGTGACGGCAGAT816                            LysHisArgPheAspGlyArgIleSerIleThrArgValThrAlaAsp                               260265270                                                                      ATTTCCCTGGCTAAGCGCTCAGTTCTCAACAACCCCAGCAAACACATC864                            IleSerLeuAlaLysArgSerValLeuAsnAsnProSerLysHisIle                               275280285                                                                      ATCATTGAGCGCTCCAACACACGCTCCAGCCTGGCTGAGGTGCAGAGT912                            IleIleGluArgSerAsnThrArgSerSerLeuAlaGluValGlnSer                               290295300                                                                      GAAATCGAGCGAATCTTCGAGCTGGCCCGGACCCTTCAGTTGGTCGCT960                            GluIleGluArgIlePheGluLeuAlaArgThrLeuGlnLeuValAla                               305310315320                                                                   CTGGATGCTGACACCATCAATCACCCAGCCCAGCTGTCCAAGACCTCG1008                           LeuAspAlaAspThrIleAsnHisProAlaGlnLeuSerLysThrSer                               325330335                                                                      CTGGCCCCCATCATTGTTTACATCAAGATCACCTCTCCCAAGGTACTT1056                           LeuAlaProIleIleValTyrIleLysIleThrSerProLysValLeu                               340345350                                                                      CAAAGGCTCATCAAGTCCCGAGGAAAGTCTCAGTCCAAACACCTCAAT1104                           GlnArgLeuIleLysSerArgGlyLysSerGlnSerLysHisLeuAsn                               355360365                                                                      GTCCAAATAGCGGCCTCGGAAAAGCTGGCACAGTGCCCCCCTGAAATG1152                           ValGlnIleAlaAlaSerGluLysLeuAlaGlnCysProProGluMet                               370375380                                                                      TTTGACATCATCCTGGATGAGAACCAATTGGAGGATGCCTGCGAGCAT1200                           PheAspIleIleLeuAspGluAsnGlnLeuGluAspAlaCysGluHis                               385390395400                                                                   CTGGCGGAGTACTTGGAAGCCTATTGGAAGGCCACACACCCGCCCAGC1248                           LeuAlaGluTyrLeuGluAlaTyrTrpLysAlaThrHisProProSer                               405410415                                                                      AGCACGCCACCCAATCCGCTGCTGAACCGCACCATGGCTACCGCAGCC1296                           SerThrProProAsnProLeuLeuAsnArgThrMetAlaThrAlaAla                               420425430                                                                      CTGGCTGCCAGCCCTGCCCCTGTCTCCAACCTCCAGGGACCCTACCTT1344                           LeuAlaAlaSerProAlaProValSerAsnLeuGlnGlyProTyrLeu                               435440445                                                                      GCTTCCGGGGACCAGCCACTGGAACGGGCCACCGGGGAGCACGCCAGC1392                           AlaSerGlyAspGlnProLeuGluArgAlaThrGlyGluHisAlaSer                               450455460                                                                      ATGCACGAGTACCCAGGGGAGCTGGGCCAGCCCCCAGGCCTTTACCCC1440                           MetHisGluTyrProGlyGluLeuGlyGlnProProGlyLeuTyrPro                               465470475480                                                                   AGCAGCCACCCACCAGGCCGGGCAGGCACGCTACGGGCACTGTCCCGC1488                           SerSerHisProProGlyArgAlaGlyThrLeuArgAlaLeuSerArg                               485490495                                                                      CAAGACACTTTTGATGCCGACACCCCCGGCAGCCGAAACTCTGCCTAC1536                           GlnAspThrPheAspAlaAspThrProGlySerArgAsnSerAlaTyr                               500505510                                                                      ACGGAGCTGGGAGACTCATGTGTGGACATGGAGACTGACCCCTCAGAG1584                           ThrGluLeuGlyAspSerCysValAspMetGluThrAspProSerGlu                               515520525                                                                      GGGCCAGGGCTTGGAGACCCTGCAGGGGGCGGCACGCCCCCAGCCCGA1632                           GlyProGlyLeuGlyAspProAlaGlyGlyGlyThrProProAlaArg                               530535540                                                                      CAGGGATCCTGGGAGGACGAGGAAGAAGACTATGAGGAAGAGCTGACC1680                           GlnGlySerTrpGluAspGluGluGluAspTyrGluGluGluLeuThr                               545550555560                                                                   GACAACCGGAACCGGGGCCGGAATAAGGCCCGCTACTGCGCTGAGGGT1728                           AspAsnArgAsnArgGlyArgAsnLysAlaArgTyrCysAlaGluGly                               565570575                                                                      GGGGGTCCAGTTTTGGGGCGCAACAAGAATGAGCTGGAGGGCTGGGGA1776                           GlyGlyProValLeuGlyArgAsnLysAsnGluLeuGluGlyTrpGly                               580585590                                                                      CGAGGCGTCTACATTCGCTGAGAGGCAGGGGCCACACGGCGGGAGGAA1824                           ArgGlyValTyrIleArg                                                             595                                                                            GGGCTCTGAGCCCAGGGGAGGGGAGGG1851                                                (2) INFORMATION FOR SEQ ID NO:11:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3600 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 35..3310                                                         (D) OTHER INFORMATION: /standard_name="Alpha-2b"                               (ix) FEATURE:                                                                  (A) NAME/KEY: 5'UTR                                                            (B) LOCATION: 1..34                                                            (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 3308..3600                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                       GCGGGGGAGGGGGCATTGATCTTCGATCGCGAAGATGGCTGCTGGCTGCCTG52                         MetAlaAlaGlyCysLeu                                                             15                                                                             CTGGCCTTGACTCTGACACTTTTCCAATCTTTGCTCATCGGCCCCTCG100                            LeuAlaLeuThrLeuThrLeuPheGlnSerLeuLeuIleGlyProSer                               101520                                                                         TCGGAGGAGCCGTTCCCTTCGGCCGTCACTATCAAATCATGGGTGGAT148                            SerGluGluProPheProSerAlaValThrIleLysSerTrpValAsp                               253035                                                                         AAGATGCAAGAAGACCTTGTCACACTGGCAAAAACAGCAAGTGGAGTC196                            LysMetGlnGluAspLeuValThrLeuAlaLysThrAlaSerGlyVal                               404550                                                                         AATCAGCTTGTTGATATTTATGAGAAATATCAAGATTTGTATACTGTG244                            AsnGlnLeuValAspIleTyrGluLysTyrGlnAspLeuTyrThrVal                               55606570                                                                       GAACCAAATAATGCACGCCAGCTGGTAGAAATTGCAGCCAGGGATATT292                            GluProAsnAsnAlaArgGlnLeuValGluIleAlaAlaArgAspIle                               758085                                                                         GAGAAACTTCTGAGCAACAGATCTAAAGCCCTGGTGAGCCTGGCATTG340                            GluLysLeuLeuSerAsnArgSerLysAlaLeuValSerLeuAlaLeu                               9095100                                                                        GAAGCGGAGAAAGTTCAAGCAGCTCACCAGTGGAGAGAAGATTTTGCA388                            GluAlaGluLysValGlnAlaAlaHisGlnTrpArgGluAspPheAla                               105110115                                                                      AGCAATGAAGTTGTCTACTACAATGCAAAGGATGATCTCGATCCTGAG436                            SerAsnGluValValTyrTyrAsnAlaLysAspAspLeuAspProGlu                               120125130                                                                      AAAAATGACAGTGAGCCAGGCAGCCAGAGGATAAAACCTGTTTTCATT484                            LysAsnAspSerGluProGlySerGlnArgIleLysProValPheIle                               135140145150                                                                   GAAGATGCTAATTTTGGACGACAAATATCTTATCAGCACGCAGCAGTC532                            GluAspAlaAsnPheGlyArgGlnIleSerTyrGlnHisAlaAlaVal                               155160165                                                                      CATATTCCTACTGACATCTATGAGGGCTCAACAATTGTGTTAAATGAA580                            HisIleProThrAspIleTyrGluGlySerThrIleValLeuAsnGlu                               170175180                                                                      CTCAACTGGACAAGTGCCTTAGATGAAGTTTTCAAAAAGAATCGCGAG628                            LeuAsnTrpThrSerAlaLeuAspGluValPheLysLysAsnArgGlu                               185190195                                                                      GAAGACCCTTCATTATTGTGGCAGGTTTTTGGCAGTGCCACTGGCCTA676                            GluAspProSerLeuLeuTrpGlnValPheGlySerAlaThrGlyLeu                               200205210                                                                      GCTCGATATTATCCAGCTTCACCATGGGTTGATAATAGTAGAACTCCA724                            AlaArgTyrTyrProAlaSerProTrpValAspAsnSerArgThrPro                               215220225230                                                                   AATAAGATTGACCTTTATGATGTACGCAGAAGACCATGGTACATCCAA772                            AsnLysIleAspLeuTyrAspValArgArgArgProTrpTyrIleGln                               235240245                                                                      GGAGCTGCATCTCCTAAAGACATGCTTATTCTGGTGGATGTGAGTGGA820                            GlyAlaAlaSerProLysAspMetLeuIleLeuValAspValSerGly                               250255260                                                                      AGTGTTAGTGGATTGACACTTAAACTGATCCGAACATCTGTCTCCGAA868                            SerValSerGlyLeuThrLeuLysLeuIleArgThrSerValSerGlu                               265270275                                                                      ATGTTAGAAACCCTCTCAGATGATGATTTCGTGAATGTAGCTTCATTT916                            MetLeuGluThrLeuSerAspAspAspPheValAsnValAlaSerPhe                               280285290                                                                      AACAGCAATGCTCAGGATGTAAGCTGTTTTCAGCACCTTGTCCAAGCA964                            AsnSerAsnAlaGlnAspValSerCysPheGlnHisLeuValGlnAla                               295300305310                                                                   AATGTAAGAAATAAAAAAGTGTTGAAAGACGCGGTGAATAATATCACA1012                           AsnValArgAsnLysLysValLeuLysAspAlaValAsnAsnIleThr                               315320325                                                                      GCCAAAGGAATTACAGATTATAAGAAGGGCTTTAGTTTTGCTTTTGAA1060                           AlaLysGlyIleThrAspTyrLysLysGlyPheSerPheAlaPheGlu                               330335340                                                                      CAGCTGCTTAATTATAATGTTTCCAGAGCAAACTGCAATAAGATTATT1108                           GlnLeuLeuAsnTyrAsnValSerArgAlaAsnCysAsnLysIleIle                               345350355                                                                      ATGCTATTCACGGATGGAGGAGAAGAGAGAGCCCAGGAGATATTTAAC1156                           MetLeuPheThrAspGlyGlyGluGluArgAlaGlnGluIlePheAsn                               360365370                                                                      AAATACAATAAAGATAAAAAAGTACGTGTATTCAGGTTTTCAGTTGGT1204                           LysTyrAsnLysAspLysLysValArgValPheArgPheSerValGly                               375380385390                                                                   CAACACAATTATGAGAGAGGACCTATTCAGTGGATGGCCTGTGAAAAC1252                           GlnHisAsnTyrGluArgGlyProIleGlnTrpMetAlaCysGluAsn                               395400405                                                                      AAAGGTTATTATTATGAAATTCCTTCCATTGGTGCAATAAGAATCAAT1300                           LysGlyTyrTyrTyrGluIleProSerIleGlyAlaIleArgIleAsn                               410415420                                                                      ACTCAGGAATATTTGGATGTTTTGGGAAGACCAATGGTTTTAGCAGGA1348                           ThrGlnGluTyrLeuAspValLeuGlyArgProMetValLeuAlaGly                               425430435                                                                      GACAAAGCTAAGCAAGTCCAATGGACAAATGTGTACCTGGATGCATTG1396                           AspLysAlaLysGlnValGlnTrpThrAsnValTyrLeuAspAlaLeu                               440445450                                                                      GAACTGGGACTTGTCATTACTGGAACTCTTCCGGTCTTCAACATAACC1444                           GluLeuGlyLeuValIleThrGlyThrLeuProValPheAsnIleThr                               455460465470                                                                   GGCCAATTTGAAAATAAGACAAACTTAAAGAACCAGCTGATTCTTGGT1492                           GlyGlnPheGluAsnLysThrAsnLeuLysAsnGlnLeuIleLeuGly                               475480485                                                                      GTGATGGGAGTAGATGTGTCTTTGGAAGATATTAAAAGACTGACACCA1540                           ValMetGlyValAspValSerLeuGluAspIleLysArgLeuThrPro                               490495500                                                                      CGTTTTACACTGTGCCCCAATGGGTATTACTTTGCAATCGATCCTAAT1588                           ArgPheThrLeuCysProAsnGlyTyrTyrPheAlaIleAspProAsn                               505510515                                                                      GGTTATGTTTTATTACATCCAAATCTTCAGCCAAAGAACCCCAAATCT1636                           GlyTyrValLeuLeuHisProAsnLeuGlnProLysAsnProLysSer                               520525530                                                                      CAGGAGCCAGTAACATTGGATTTCCTTGATGCAGAGTTAGAGAATGAT1684                           GlnGluProValThrLeuAspPheLeuAspAlaGluLeuGluAsnAsp                               535540545550                                                                   ATTAAAGTGGAGATTCGAAATAAGATGATTGATGGGGAAAGTGGAGAA1732                           IleLysValGluIleArgAsnLysMetIleAspGlyGluSerGlyGlu                               555560565                                                                      AAAACATTCAGAACTCTGGTTAAATCTCAAGATGAGAGATATATTGAC1780                           LysThrPheArgThrLeuValLysSerGlnAspGluArgTyrIleAsp                               570575580                                                                      AAAGGAAACAGGACATACACATGGACACCTGTCAATGGCACAGATTAC1828                           LysGlyAsnArgThrTyrThrTrpThrProValAsnGlyThrAspTyr                               585590595                                                                      AGTTTGGCCTTGGTATTACCAACCTACAGTTTTTACTATATAAAAGCC1876                           SerLeuAlaLeuValLeuProThrTyrSerPheTyrTyrIleLysAla                               600605610                                                                      AAACTAGAAGAGACAATAACTCAGGCCAGATCAAAAAAGGGCAAAATG1924                           LysLeuGluGluThrIleThrGlnAlaArgSerLysLysGlyLysMet                               615620625630                                                                   AAGGATTCGGAAACCCTGAAGCCAGATAATTTTGAAGAATCTGGCTAT1972                           LysAspSerGluThrLeuLysProAspAsnPheGluGluSerGlyTyr                               635640645                                                                      ACATTCATAGCACCAAGAGATTACTGCAATGACCTGAAAATATCGGAT2020                           ThrPheIleAlaProArgAspTyrCysAsnAspLeuLysIleSerAsp                               650655660                                                                      AATAACACTGAATTTCTTTTAAATTTCAACGAGTTTATTGATAGAAAA2068                           AsnAsnThrGluPheLeuLeuAsnPheAsnGluPheIleAspArgLys                               665670675                                                                      ACTCCAAACAACCCATCATGTAACGCGGATTTGATTAATAGAGTCTTG2116                           ThrProAsnAsnProSerCysAsnAlaAspLeuIleAsnArgValLeu                               680685690                                                                      CTTGATGCAGGCTTTACAAATGAACTTGTCCAAAATTACTGGAGTAAG2164                           LeuAspAlaGlyPheThrAsnGluLeuValGlnAsnTyrTrpSerLys                               695700705710                                                                   CAGAAAAATATCAAGGGAGTGAAAGCACGATTTGTTGTGACTGATGGT2212                           GlnLysAsnIleLysGlyValLysAlaArgPheValValThrAspGly                               715720725                                                                      GGGATTACCAGAGTTTATCCCAAAGAGGCTGGAGAAAATTGGCAAGAA2260                           GlyIleThrArgValTyrProLysGluAlaGlyGluAsnTrpGlnGlu                               730735740                                                                      AACCCAGAGACATATGAGGACAGCTTCTATAAAAGGAGCCTAGATAAT2308                           AsnProGluThrTyrGluAspSerPheTyrLysArgSerLeuAspAsn                               745750755                                                                      GATAACTATGTTTTCACTGCTCCCTACTTTAACAAAAGTGGACCTGGT2356                           AspAsnTyrValPheThrAlaProTyrPheAsnLysSerGlyProGly                               760765770                                                                      GCCTATGAATCGGGCATTATGGTAAGCAAAGCTGTAGAAATATATATT2404                           AlaTyrGluSerGlyIleMetValSerLysAlaValGluIleTyrIle                               775780785790                                                                   CAAGGGAAACTTCTTAAACCTGCAGTTGTTGGAATTAAAATTGATGTA2452                           GlnGlyLysLeuLeuLysProAlaValValGlyIleLysIleAspVal                               795800805                                                                      AATTCCTGGATAGAGAATTTCACCAAAACCTCAATCAGAGATCCGTGT2500                           AsnSerTrpIleGluAsnPheThrLysThrSerIleArgAspProCys                               810815820                                                                      GCTGGTCCAGTTTGTGACTGCAAAAGAAACAGTGACGTAATGGATTGT2548                           AlaGlyProValCysAspCysLysArgAsnSerAspValMetAspCys                               825830835                                                                      GTGATTCTGGATGATGGTGGGTTTCTTCTGATGGCAAATCATGATGAT2596                           ValIleLeuAspAspGlyGlyPheLeuLeuMetAlaAsnHisAspAsp                               840845850                                                                      TATACTAATCAGATTGGAAGATTTTTTGGAGAGATTGATCCCAGCTTG2644                           TyrThrAsnGlnIleGlyArgPhePheGlyGluIleAspProSerLeu                               855860865870                                                                   ATGAGACACCTGGTTAATATATCAGTTTATGCTTTTAACAAATCTTAT2692                           MetArgHisLeuValAsnIleSerValTyrAlaPheAsnLysSerTyr                               875880885                                                                      GATTATCAGTCAGTATGTGAGCCCGGTGCTGCACCAAAACAAGGAGCA2740                           AspTyrGlnSerValCysGluProGlyAlaAlaProLysGlnGlyAla                               890895900                                                                      GGACATCGCTCAGCATATGTGCCATCAGTAGCAGACATATTACAAATT2788                           GlyHisArgSerAlaTyrValProSerValAlaAspIleLeuGlnIle                               905910915                                                                      GGCTGGTGGGCCACTGCTGCTGCCTGGTCTATTCTACAGCAGTTTCTC2836                           GlyTrpTrpAlaThrAlaAlaAlaTrpSerIleLeuGlnGlnPheLeu                               920925930                                                                      TTGAGTTTGACCTTTCCACGACTCCTTGAGGCAGTTGAGATGGAGGAT2884                           LeuSerLeuThrPheProArgLeuLeuGluAlaValGluMetGluAsp                               935940945950                                                                   GATGACTTCACGGCCTCCCTGTCCAAGCAGAGCTGCATTACTGAACAA2932                           AspAspPheThrAlaSerLeuSerLysGlnSerCysIleThrGluGln                               955960965                                                                      ACCCAGTATTTCTTCGATAACGACAGTAAATCATTCAGTGGTGTATTA2980                           ThrGlnTyrPhePheAspAsnAspSerLysSerPheSerGlyValLeu                               970975980                                                                      GACTGTGGAAACTGTTCCAGAATCTTTCATGGAGAAAAGCTTATGAAC3028                           AspCysGlyAsnCysSerArgIlePheHisGlyGluLysLeuMetAsn                               985990995                                                                      ACCAACTTAATATTCATAATGGTTGAGAGCAAAGGGACATGTCCATGT3076                           ThrAsnLeuIlePheIleMetValGluSerLysGlyThrCysProCys                               100010051010                                                                   GACACACGACTGCTCATACAAGCGGAGCAGACTTCTGACGGTCCAAAT3124                           AspThrArgLeuLeuIleGlnAlaGluGlnThrSerAspGlyProAsn                               1015102010251030                                                               CCTTGTGACATGGTTAAGCAACCTAGATACCGAAAAGGGCCTGATGTC3172                           ProCysAspMetValLysGlnProArgTyrArgLysGlyProAspVal                               103510401045                                                                   TGCTTTGATAACAATGTCTTGGAGGATTATACTGACTGTGGTGGTGTT3220                           CysPheAspAsnAsnValLeuGluAspTyrThrAspCysGlyGlyVal                               105010551060                                                                   TCTGGATTAAATCCCTCCCTGTGGTATATCATTGGAATCCAGTTTCTA3268                           SerGlyLeuAsnProSerLeuTrpTyrIleIleGlyIleGlnPheLeu                               106510701075                                                                   CTACTTTGGCTGGTATCTGGCAGCACACACCGGCTGTTATGACCTTCTA3317                          LeuLeuTrpLeuValSerGlySerThrHisArgLeuLeu                                        108010851090                                                                   AAAACCAAATCTGCATAGTTAAACTCCAGACCCTGCCAAAACATGAGCCCTGCCCTCAAT3377               TACAGTAACGTAGGGTCAGCTATAAAATCAGACAAACATTAGCTGGGCCTGTTCCATGGC3437               ATAACACTAAGGCGCAGACTCCTAAGGCACCCACTGGCTGCATGTCAGGGTGTCAGATCC3497               TTAAACGTGTGTGAATGCTGCATCATCTATGTGTAACATCAAAGCAAAATCCTATACGTG3557               TCCTCTATTGGAAAATTTGGGCGTTTGTTGTTGCATTGTTGGT3600                                (2) INFORMATION FOR SEQ ID NO:12:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 323 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                       CCCCCTGCCAGTGGCCAAACAGAAGCAGAAGTCGGGTAATGAAATGACTAACTTAGCCTT60                 TGAACTAGACCCCCTAGAGTTAGAGGAGGAAGAGGCTGAGCTTGGTGAGCAGAGTGGCTC120                TGCCAAGACTAGTGTTAGCAGTGTCACCACCCCGCCACCCCATGGCAAACGCATCCCCTT180                CTTTAAGAAGACAGAGCATGTGCCCCCCTATGACGTGGTGCCTTCCATGAGGCCCATCAT240                CCTGGTGGGACCGTCGCTCAAGGGCTACGAGGTTACAGACATGATGCAGAAAGCTTTATT300                TGACTTCTTGAAGCATCGGTTTG323                                                     (2) INFORMATION FOR SEQ ID NO:13:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 57 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                       CCTATTGGTGTAGGTATACCAACAATTAATTTAAGAAAAAGGAGACCCAATATCCAG57                    (2) INFORMATION FOR SEQ ID NO:14:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 180 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..132                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                       TGGTCCTTTGCCTGCGCCTGTGCCGCCTTCATCCTCCTCTTTCTCGGC48                             TrpSerPheAlaCysAlaCysAlaAlaPheIleLeuLeuPheLeuGly                               151015                                                                         GGTCTCGCCCTCCTGCTGTTCTCCCTGCCTCGAATGCCCCGGAACCCA96                             GlyLeuAlaLeuLeuLeuPheSerLeuProArgMetProArgAsnPro                               202530                                                                         TGGGAGTCCTGCATGGATGCTGAGCCCGAGCACTAACCCTCCTGCGGCCCTAG149                       TrpGluSerCysMetAspAlaGluProGluHis                                              3540                                                                           CGACCCTCAGGCTTCTTCCCAGGAAGCGGGG180                                             (2) INFORMATION FOR SEQ ID NO:15:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 22 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: Other nucleic acid;                                        (A) DESCRIPTION: Oligonucleotide                                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                       AATTCGGTACGTACACTCGAGC22                                                       (2) INFORMATION FOR SEQ ID NO:16:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 18 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: Other nucleic acid;                                        (A) DESCRIPTION: Oligonucleotide                                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                       GCTCGAGTGTACGTACCG18                                                           (2) INFORMATION FOR SEQ ID NO:17:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: Other nucleic acid;                                        (A) DESCRIPTION: Oligonucleotide                                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                       CCATGGTACCTTCGTTGACG20                                                         (2) INFORMATION FOR SEQ ID NO:18:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: Other nucleic acid;                                        (A) DESCRIPTION: Oligonucleotide                                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                       AATTCGTCAACGAAGGTACCATGG24                                                     (2) INFORMATION FOR SEQ ID NO:19:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 249 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                       CGGTACGTACACTCGAGCGACTGTGTCATGATGTTTATTAGTGATGACAGTGAGGTGAGG60                 CAGGGGCTTGTGGAGCATGCTCTGTAGGTCACACACTAGAGCCATAAGGCAAGAGTAGGC120                GGGGAGACAGGTCCTCTGTGCCCTGTCTCTCCCCATCTAACCCTAACCTAACAAGCGGTA180                GTTATGAGTCAGGGAACAACGTCTGGAGCCCCGTCCTCCAAAGATGTTTGAGGGACAAGA240                ACAGAAATG249                                                                   (2) INFORMATION FOR SEQ ID NO:20:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 402 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                       CCACCTAGCACGGGTTCGTTCCCCTCTCCCGGCCTGGCCCGGGCTCCCCGGTGGCCGCCG60                 CCCCCTCGCCGCCCATCTCTGGACTCAGACGTCTCCCTGGAGGAGGACCGGGAGAGTGCC120                CGGCGTGAAGTAGAGAGCCAGGCTCAGCAGCAGCTCGAAAGGGCCAAGCACAAACCTGTG180                GCATTTGCGGTGAGGACCAATGTCAGCTACTGTGGCGTACTGGATGAGGAGTGCCCAGTC240                CAGGGCTCTGGAGTCAACTTTGAGGCCAAAGATTTTCTGCACATTAAAGAGAAGTACAGC300                AATGACTGGTGGATCGGGCGGCTAGTGAAAGAGGGCGGGGACATCGCCTTCATCCCCAGC360                CCCCAGTGCCTGGTGAGCATCCGCTCAAACAGGAGCAGAAGG402                                  (2) INFORMATION FOR SEQ ID NO:21:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 28 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: Other nucleic acid;                                        (A) DESCRIPTION: Oligonucleotide                                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                       CTCAGTACCATCTCTGATACCAGCCCCA28                                                 (2) INFORMATION FOR SEQ ID NO:22:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3636 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 35..3346                                                         (D) OTHER INFORMATION: /standard_name="Alpha-2a"                               (ix) FEATURE:                                                                  (A) NAME/KEY: 5'UTR                                                            (B) LOCATION: 1..34                                                            (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 3347..3636                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                       GCGGGGGAGGGGGCATTGATCTTCGATCGCGAAGATGGCTGCTGGCTGCCTG52                         MetAlaAlaGlyCysLeu                                                             15                                                                             CTGGCCTTGACTCTGACACTTTTCCAATCTTTGCTCATCGGCCCCTCG100                            LeuAlaLeuThrLeuThrLeuPheGlnSerLeuLeuIleGlyProSer                               101520                                                                         TCGGAGGAGCCGTTCCCTTCGGCCGTCACTATCAAATCATGGGTGGAT148                            SerGluGluProPheProSerAlaValThrIleLysSerTrpValAsp                               253035                                                                         AAGATGCAAGAAGACCTTGTCACACTGGCAAAAACAGCAAGTGGAGTC196                            LysMetGlnGluAspLeuValThrLeuAlaLysThrAlaSerGlyVal                               404550                                                                         AATCAGCTTGTTGATATTTATGAGAAATATCAAGATTTGTATACTGTG244                            AsnGlnLeuValAspIleTyrGluLysTyrGlnAspLeuTyrThrVal                               55606570                                                                       GAACCAAATAATGCACGCCAGCTGGTAGAAATTGCAGCCAGGGATATT292                            GluProAsnAsnAlaArgGlnLeuValGluIleAlaAlaArgAspIle                               758085                                                                         GAGAAACTTCTGAGCAACAGATCTAAAGCCCTGGTGAGCCTGGCATTG340                            GluLysLeuLeuSerAsnArgSerLysAlaLeuValSerLeuAlaLeu                               9095100                                                                        GAAGCGGAGAAAGTTCAAGCAGCTCACCAGTGGAGAGAAGATTTTGCA388                            GluAlaGluLysValGlnAlaAlaHisGlnTrpArgGluAspPheAla                               105110115                                                                      AGCAATGAAGTTGTCTACTACAATGCAAAGGATGATCTCGATCCTGAG436                            SerAsnGluValValTyrTyrAsnAlaLysAspAspLeuAspProGlu                               120125130                                                                      AAAAATGACAGTGAGCCAGGCAGCCAGAGGATAAAACCTGTTTTCATT484                            LysAsnAspSerGluProGlySerGlnArgIleLysProValPheIle                               135140145150                                                                   GAAGATGCTAATTTTGGACGACAAATATCTTATCAGCACGCAGCAGTC532                            GluAspAlaAsnPheGlyArgGlnIleSerTyrGlnHisAlaAlaVal                               155160165                                                                      CATATTCCTACTGACATCTATGAGGGCTCAACAATTGTGTTAAATGAA580                            HisIleProThrAspIleTyrGluGlySerThrIleValLeuAsnGlu                               170175180                                                                      CTCAACTGGACAAGTGCCTTAGATGAAGTTTTCAAAAAGAATCGCGAG628                            LeuAsnTrpThrSerAlaLeuAspGluValPheLysLysAsnArgGlu                               185190195                                                                      GAAGACCCTTCATTATTGTGGCAGGTTTTTGGCAGTGCCACTGGCCTA676                            GluAspProSerLeuLeuTrpGlnValPheGlySerAlaThrGlyLeu                               200205210                                                                      GCTCGATATTATCCAGCTTCACCATGGGTTGATAATAGTAGAACTCCA724                            AlaArgTyrTyrProAlaSerProTrpValAspAsnSerArgThrPro                               215220225230                                                                   AATAAGATTGACCTTTATGATGTACGCAGAAGACCATGGTACATCCAA772                            AsnLysIleAspLeuTyrAspValArgArgArgProTrpTyrIleGln                               235240245                                                                      GGAGCTGCATCTCCTAAAGACATGCTTATTCTGGTGGATGTGAGTGGA820                            GlyAlaAlaSerProLysAspMetLeuIleLeuValAspValSerGly                               250255260                                                                      AGTGTTAGTGGATTGACACTTAAACTGATCCGAACATCTGTCTCCGAA868                            SerValSerGlyLeuThrLeuLysLeuIleArgThrSerValSerGlu                               265270275                                                                      ATGTTAGAAACCCTCTCAGATGATGATTTCGTGAATGTAGCTTCATTT916                            MetLeuGluThrLeuSerAspAspAspPheValAsnValAlaSerPhe                               280285290                                                                      AACAGCAATGCTCAGGATGTAAGCTGTTTTCAGCACCTTGTCCAAGCA964                            AsnSerAsnAlaGlnAspValSerCysPheGlnHisLeuValGlnAla                               295300305310                                                                   AATGTAAGAAATAAAAAAGTGTTGAAAGACGCGGTGAATAATATCACA1012                           AsnValArgAsnLysLysValLeuLysAspAlaValAsnAsnIleThr                               315320325                                                                      GCCAAAGGAATTACAGATTATAAGAAGGGCTTTAGTTTTGCTTTTGAA1060                           AlaLysGlyIleThrAspTyrLysLysGlyPheSerPheAlaPheGlu                               330335340                                                                      CAGCTGCTTAATTATAATGTTTCCAGAGCAAACTGCAATAAGATTATT1108                           GlnLeuLeuAsnTyrAsnValSerArgAlaAsnCysAsnLysIleIle                               345350355                                                                      ATGCTATTCACGGATGGAGGAGAAGAGAGAGCCCAGGAGATATTTAAC1156                           MetLeuPheThrAspGlyGlyGluGluArgAlaGlnGluIlePheAsn                               360365370                                                                      AAATACAATAAAGATAAAAAAGTACGTGTATTCAGGTTTTCAGTTGGT1204                           LysTyrAsnLysAspLysLysValArgValPheArgPheSerValGly                               375380385390                                                                   CAACACAATTATGAGAGAGGACCTATTCAGTGGATGGCCTGTGAAAAC1252                           GlnHisAsnTyrGluArgGlyProIleGlnTrpMetAlaCysGluAsn                               395400405                                                                      AAAGGTTATTATTATGAAATTCCTTCCATTGGTGCAATAAGAATCAAT1300                           LysGlyTyrTyrTyrGluIleProSerIleGlyAlaIleArgIleAsn                               410415420                                                                      ACTCAGGAATATTTGGATGTTTTGGGAAGACCAATGGTTTTAGCAGGA1348                           ThrGlnGluTyrLeuAspValLeuGlyArgProMetValLeuAlaGly                               425430435                                                                      GACAAAGCTAAGCAAGTCCAATGGACAAATGTGTACCTGGATGCATTG1396                           AspLysAlaLysGlnValGlnTrpThrAsnValTyrLeuAspAlaLeu                               440445450                                                                      GAACTGGGACTTGTCATTACTGGAACTCTTCCGGTCTTCAACATAACC1444                           GluLeuGlyLeuValIleThrGlyThrLeuProValPheAsnIleThr                               455460465470                                                                   GGCCAATTTGAAAATAAGACAAACTTAAAGAACCAGCTGATTCTTGGT1492                           GlyGlnPheGluAsnLysThrAsnLeuLysAsnGlnLeuIleLeuGly                               475480485                                                                      GTGATGGGAGTAGATGTGTCTTTGGAAGATATTAAAAGACTGACACCA1540                           ValMetGlyValAspValSerLeuGluAspIleLysArgLeuThrPro                               490495500                                                                      CGTTTTACACTGTGCCCCAATGGGTATTACTTTGCAATCGATCCTAAT1588                           ArgPheThrLeuCysProAsnGlyTyrTyrPheAlaIleAspProAsn                               505510515                                                                      GGTTATGTTTTATTACATCCAAATCTTCAGCCAAAGCCTATTGGTGTA1636                           GlyTyrValLeuLeuHisProAsnLeuGlnProLysProIleGlyVal                               520525530                                                                      GGTATACCAACAATTAATTTAAGAAAAAGGAGACCCAATATCCAGAAC1684                           GlyIleProThrIleAsnLeuArgLysArgArgProAsnIleGlnAsn                               535540545550                                                                   CCCAAATCTCAGGAGCCAGTAACATTGGATTTCCTTGATGCAGAGTTA1732                           ProLysSerGlnGluProValThrLeuAspPheLeuAspAlaGluLeu                               555560565                                                                      GAGAATGATATTAAAGTGGAGATTCGAAATAAGATGATTGATGGGGAA1780                           GluAsnAspIleLysValGluIleArgAsnLysMetIleAspGlyGlu                               570575580                                                                      AGTGGAGAAAAAACATTCAGAACTCTGGTTAAATCTCAAGATGAGAGA1828                           SerGlyGluLysThrPheArgThrLeuValLysSerGlnAspGluArg                               585590595                                                                      TATATTGACAAAGGAAACAGGACATACACATGGACACCTGTCAATGGC1876                           TyrIleAspLysGlyAsnArgThrTyrThrTrpThrProValAsnGly                               600605610                                                                      ACAGATTACAGTTTGGCCTTGGTATTACCAACCTACAGTTTTTACTAT1924                           ThrAspTyrSerLeuAlaLeuValLeuProThrTyrSerPheTyrTyr                               615620625630                                                                   ATAAAAGCCAAACTAGAAGAGACAATAACTCAGGCCAGATATTCGGAA1972                           IleLysAlaLysLeuGluGluThrIleThrGlnAlaArgTyrSerGlu                               635640645                                                                      ACCCTGAAGCCAGATAATTTTGAAGAATCTGGCTATACATTCATAGCA2020                           ThrLeuLysProAspAsnPheGluGluSerGlyTyrThrPheIleAla                               650655660                                                                      CCAAGAGATTACTGCAATGACCTGAAAATATCGGATAATAACACTGAA2068                           ProArgAspTyrCysAsnAspLeuLysIleSerAspAsnAsnThrGlu                               665670675                                                                      TTTCTTTTAAATTTCAACGAGTTTATTGATAGAAAAACTCCAAACAAC2116                           PheLeuLeuAsnPheAsnGluPheIleAspArgLysThrProAsnAsn                               680685690                                                                      CCATCATGTAACGCGGATTTGATTAATAGAGTCTTGCTTGATGCAGGC2164                           ProSerCysAsnAlaAspLeuIleAsnArgValLeuLeuAspAlaGly                               695700705710                                                                   TTTACAAATGAACTTGTCCAAAATTACTGGAGTAAGCAGAAAAATATC2212                           PheThrAsnGluLeuValGlnAsnTyrTrpSerLysGlnLysAsnIle                               715720725                                                                      AAGGGAGTGAAAGCACGATTTGTTGTGACTGATGGTGGGATTACCAGA2260                           LysGlyValLysAlaArgPheValValThrAspGlyGlyIleThrArg                               730735740                                                                      GTTTATCCCAAAGAGGCTGGAGAAAATTGGCAAGAAAACCCAGAGACA2308                           ValTyrProLysGluAlaGlyGluAsnTrpGlnGluAsnProGluThr                               745750755                                                                      TATGAGGACAGCTTCTATAAAAGGAGCCTAGATAATGATAACTATGTT2356                           TyrGluAspSerPheTyrLysArgSerLeuAspAsnAspAsnTyrVal                               760765770                                                                      TTCACTGCTCCCTACTTTAACAAAAGTGGACCTGGTGCCTATGAATCG2404                           PheThrAlaProTyrPheAsnLysSerGlyProGlyAlaTyrGluSer                               775780785790                                                                   GGCATTATGGTAAGCAAAGCTGTAGAAATATATATTCAAGGGAAACTT2452                           GlyIleMetValSerLysAlaValGluIleTyrIleGlnGlyLysLeu                               795800805                                                                      CTTAAACCTGCAGTTGTTGGAATTAAAATTGATGTAAATTCCTGGATA2500                           LeuLysProAlaValValGlyIleLysIleAspValAsnSerTrpIle                               810815820                                                                      GAGAATTTCACCAAAACCTCAATCAGAGATCCGTGTGCTGGTCCAGTT2548                           GluAsnPheThrLysThrSerIleArgAspProCysAlaGlyProVal                               825830835                                                                      TGTGACTGCAAAAGAAACAGTGACGTAATGGATTGTGTGATTCTGGAT2596                           CysAspCysLysArgAsnSerAspValMetAspCysValIleLeuAsp                               840845850                                                                      GATGGTGGGTTTCTTCTGATGGCAAATCATGATGATTATACTAATCAG2644                           AspGlyGlyPheLeuLeuMetAlaAsnHisAspAspTyrThrAsnGln                               855860865870                                                                   ATTGGAAGATTTTTTGGAGAGATTGATCCCAGCTTGATGAGACACCTG2692                           IleGlyArgPhePheGlyGluIleAspProSerLeuMetArgHisLeu                               875880885                                                                      GTTAATATATCAGTTTATGCTTTTAACAAATCTTATGATTATCAGTCA2740                           ValAsnIleSerValTyrAlaPheAsnLysSerTyrAspTyrGlnSer                               890895900                                                                      GTATGTGAGCCCGGTGCTGCACCAAAACAAGGAGCAGGACATCGCTCA2788                           ValCysGluProGlyAlaAlaProLysGlnGlyAlaGlyHisArgSer                               905910915                                                                      GCATATGTGCCATCAGTAGCAGACATATTACAAATTGGCTGGTGGGCC2836                           AlaTyrValProSerValAlaAspIleLeuGlnIleGlyTrpTrpAla                               920925930                                                                      ACTGCTGCTGCCTGGTCTATTCTACAGCAGTTTCTCTTGAGTTTGACC2884                           ThrAlaAlaAlaTrpSerIleLeuGlnGlnPheLeuLeuSerLeuThr                               935940945950                                                                   TTTCCACGACTCCTTGAGGCAGTTGAGATGGAGGATGATGACTTCACG2932                           PheProArgLeuLeuGluAlaValGluMetGluAspAspAspPheThr                               955960965                                                                      GCCTCCCTGTCCAAGCAGAGCTGCATTACTGAACAAACCCAGTATTTC2980                           AlaSerLeuSerLysGlnSerCysIleThrGluGlnThrGlnTyrPhe                               970975980                                                                      TTCGATAACGACAGTAAATCATTCAGTGGTGTATTAGACTGTGGAAAC3028                           PheAspAsnAspSerLysSerPheSerGlyValLeuAspCysGlyAsn                               985990995                                                                      TGTTCCAGAATCTTTCATGGAGAAAAGCTTATGAACACCAACTTAATA3076                           CysSerArgIlePheHisGlyGluLysLeuMetAsnThrAsnLeuIle                               100010051010                                                                   TTCATAATGGTTGAGAGCAAAGGGACATGTCCATGTGACACACGACTG3124                           PheIleMetValGluSerLysGlyThrCysProCysAspThrArgLeu                               1015102010251030                                                               CTCATACAAGCGGAGCAGACTTCTGACGGTCCAAATCCTTGTGACATG3172                           LeuIleGlnAlaGluGlnThrSerAspGlyProAsnProCysAspMet                               103510401045                                                                   GTTAAGCAACCTAGATACCGAAAAGGGCCTGATGTCTGCTTTGATAAC3220                           ValLysGlnProArgTyrArgLysGlyProAspValCysPheAspAsn                               105010551060                                                                   AATGTCTTGGAGGATTATACTGACTGTGGTGGTGTTTCTGGATTAAAT3268                           AsnValLeuGluAspTyrThrAspCysGlyGlyValSerGlyLeuAsn                               106510701075                                                                   CCCTCCCTGTGGTATATCATTGGAATCCAGTTTCTACTACTTTGGCTG3316                           ProSerLeuTrpTyrIleIleGlyIleGlnPheLeuLeuLeuTrpLeu                               108010851090                                                                   GTATCTGGCAGCACACACCGGCTGTTATGACCTTCTAAAAACCAAAT3363                            ValSerGlySerThrHisArgLeuLeu                                                    10951100                                                                       CTGCATAGTTAAACTCCAGACCCTGCCAAAACATGAGCCCTGCCCTCAATTACAGTAACG3423               TAGGGTCAGCTATAAAATCAGACAAACATTAGCTGGGCCTGTTCCATGGCATAACACTAA3483               GGCGCAGACTCCTAAGGCACCCACTGGCTGCATGTCAGGGTGTCAGATCCTTAAACGTGT3543               GTGAATGCTGCATCATCTATGTGTAACATCAAAGCAAAATCCTATACGTGTCCTCTATTG3603               GAAAATTTGGGCGTTTGTTGTTGCATTGTTGGT3636                                          (2) INFORMATION FOR SEQ ID NO:23:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3585 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 35..3295                                                         (D) OTHER INFORMATION: /standard_name="Alpha-2c"                               (ix) FEATURE:                                                                  (A) NAME/KEY: 5'UTR                                                            (B) LOCATION: 1..34                                                            (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 3296..3585                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                       GCGGGGGAGGGGGCATTGATCTTCGATCGCGAAGATGGCTGCTGGCTGCCTG52                         MetAlaAlaGlyCysLeu                                                             15                                                                             CTGGCCTTGACTCTGACACTTTTCCAATCTTTGCTCATCGGCCCCTCG100                            LeuAlaLeuThrLeuThrLeuPheGlnSerLeuLeuIleGlyProSer                               101520                                                                         TCGGAGGAGCCGTTCCCTTCGGCCGTCACTATCAAATCATGGGTGGAT148                            SerGluGluProPheProSerAlaValThrIleLysSerTrpValAsp                               253035                                                                         AAGATGCAAGAAGACCTTGTCACACTGGCAAAAACAGCAAGTGGAGTC196                            LysMetGlnGluAspLeuValThrLeuAlaLysThrAlaSerGlyVal                               404550                                                                         AATCAGCTTGTTGATATTTATGAGAAATATCAAGATTTGTATACTGTG244                            AsnGlnLeuValAspIleTyrGluLysTyrGlnAspLeuTyrThrVal                               55606570                                                                       GAACCAAATAATGCACGCCAGCTGGTAGAAATTGCAGCCAGGGATATT292                            GluProAsnAsnAlaArgGlnLeuValGluIleAlaAlaArgAspIle                               758085                                                                         GAGAAACTTCTGAGCAACAGATCTAAAGCCCTGGTGAGCCTGGCATTG340                            GluLysLeuLeuSerAsnArgSerLysAlaLeuValSerLeuAlaLeu                               9095100                                                                        GAAGCGGAGAAAGTTCAAGCAGCTCACCAGTGGAGAGAAGATTTTGCA388                            GluAlaGluLysValGlnAlaAlaHisGlnTrpArgGluAspPheAla                               105110115                                                                      AGCAATGAAGTTGTCTACTACAATGCAAAGGATGATCTCGATCCTGAG436                            SerAsnGluValValTyrTyrAsnAlaLysAspAspLeuAspProGlu                               120125130                                                                      AAAAATGACAGTGAGCCAGGCAGCCAGAGGATAAAACCTGTTTTCATT484                            LysAsnAspSerGluProGlySerGlnArgIleLysProValPheIle                               135140145150                                                                   GAAGATGCTAATTTTGGACGACAAATATCTTATCAGCACGCAGCAGTC532                            GluAspAlaAsnPheGlyArgGlnIleSerTyrGlnHisAlaAlaVal                               155160165                                                                      CATATTCCTACTGACATCTATGAGGGCTCAACAATTGTGTTAAATGAA580                            HisIleProThrAspIleTyrGluGlySerThrIleValLeuAsnGlu                               170175180                                                                      CTCAACTGGACAAGTGCCTTAGATGAAGTTTTCAAAAAGAATCGCGAG628                            LeuAsnTrpThrSerAlaLeuAspGluValPheLysLysAsnArgGlu                               185190195                                                                      GAAGACCCTTCATTATTGTGGCAGGTTTTTGGCAGTGCCACTGGCCTA676                            GluAspProSerLeuLeuTrpGlnValPheGlySerAlaThrGlyLeu                               200205210                                                                      GCTCGATATTATCCAGCTTCACCATGGGTTGATAATAGTAGAACTCCA724                            AlaArgTyrTyrProAlaSerProTrpValAspAsnSerArgThrPro                               215220225230                                                                   AATAAGATTGACCTTTATGATGTACGCAGAAGACCATGGTACATCCAA772                            AsnLysIleAspLeuTyrAspValArgArgArgProTrpTyrIleGln                               235240245                                                                      GGAGCTGCATCTCCTAAAGACATGCTTATTCTGGTGGATGTGAGTGGA820                            GlyAlaAlaSerProLysAspMetLeuIleLeuValAspValSerGly                               250255260                                                                      AGTGTTAGTGGATTGACACTTAAACTGATCCGAACATCTGTCTCCGAA868                            SerValSerGlyLeuThrLeuLysLeuIleArgThrSerValSerGlu                               265270275                                                                      ATGTTAGAAACCCTCTCAGATGATGATTTCGTGAATGTAGCTTCATTT916                            MetLeuGluThrLeuSerAspAspAspPheValAsnValAlaSerPhe                               280285290                                                                      AACAGCAATGCTCAGGATGTAAGCTGTTTTCAGCACCTTGTCCAAGCA964                            AsnSerAsnAlaGlnAspValSerCysPheGlnHisLeuValGlnAla                               295300305310                                                                   AATGTAAGAAATAAAAAAGTGTTGAAAGACGCGGTGAATAATATCACA1012                           AsnValArgAsnLysLysValLeuLysAspAlaValAsnAsnIleThr                               315320325                                                                      GCCAAAGGAATTACAGATTATAAGAAGGGCTTTAGTTTTGCTTTTGAA1060                           AlaLysGlyIleThrAspTyrLysLysGlyPheSerPheAlaPheGlu                               330335340                                                                      CAGCTGCTTAATTATAATGTTTCCAGAGCAAACTGCAATAAGATTATT1108                           GlnLeuLeuAsnTyrAsnValSerArgAlaAsnCysAsnLysIleIle                               345350355                                                                      ATGCTATTCACGGATGGAGGAGAAGAGAGAGCCCAGGAGATATTTAAC1156                           MetLeuPheThrAspGlyGlyGluGluArgAlaGlnGluIlePheAsn                               360365370                                                                      AAATACAATAAAGATAAAAAAGTACGTGTATTCAGGTTTTCAGTTGGT1204                           LysTyrAsnLysAspLysLysValArgValPheArgPheSerValGly                               375380385390                                                                   CAACACAATTATGAGAGAGGACCTATTCAGTGGATGGCCTGTGAAAAC1252                           GlnHisAsnTyrGluArgGlyProIleGlnTrpMetAlaCysGluAsn                               395400405                                                                      AAAGGTTATTATTATGAAATTCCTTCCATTGGTGCAATAAGAATCAAT1300                           LysGlyTyrTyrTyrGluIleProSerIleGlyAlaIleArgIleAsn                               410415420                                                                      ACTCAGGAATATTTGGATGTTTTGGGAAGACCAATGGTTTTAGCAGGA1348                           ThrGlnGluTyrLeuAspValLeuGlyArgProMetValLeuAlaGly                               425430435                                                                      GACAAAGCTAAGCAAGTCCAATGGACAAATGTGTACCTGGATGCATTG1396                           AspLysAlaLysGlnValGlnTrpThrAsnValTyrLeuAspAlaLeu                               440445450                                                                      GAACTGGGACTTGTCATTACTGGAACTCTTCCGGTCTTCAACATAACC1444                           GluLeuGlyLeuValIleThrGlyThrLeuProValPheAsnIleThr                               455460465470                                                                   GGCCAATTTGAAAATAAGACAAACTTAAAGAACCAGCTGATTCTTGGT1492                           GlyGlnPheGluAsnLysThrAsnLeuLysAsnGlnLeuIleLeuGly                               475480485                                                                      GTGATGGGAGTAGATGTGTCTTTGGAAGATATTAAAAGACTGACACCA1540                           ValMetGlyValAspValSerLeuGluAspIleLysArgLeuThrPro                               490495500                                                                      CGTTTTACACTGTGCCCCAATGGGTATTACTTTGCAATCGATCCTAAT1588                           ArgPheThrLeuCysProAsnGlyTyrTyrPheAlaIleAspProAsn                               505510515                                                                      GGTTATGTTTTATTACATCCAAATCTTCAGCCAAAGGAGCCAGTAACA1636                           GlyTyrValLeuLeuHisProAsnLeuGlnProLysGluProValThr                               520525530                                                                      TTGGATTTCCTTGATGCAGAGTTAGAGAATGATATTAAAGTGGAGATT1684                           LeuAspPheLeuAspAlaGluLeuGluAsnAspIleLysValGluIle                               535540545550                                                                   CGAAATAAGATGATTGATGGGGAAAGTGGAGAAAAAACATTCAGAACT1732                           ArgAsnLysMetIleAspGlyGluSerGlyGluLysThrPheArgThr                               555560565                                                                      CTGGTTAAATCTCAAGATGAGAGATATATTGACAAAGGAAACAGGACA1780                           LeuValLysSerGlnAspGluArgTyrIleAspLysGlyAsnArgThr                               570575580                                                                      TACACATGGACACCTGTCAATGGCACAGATTACAGTTTGGCCTTGGTA1828                           TyrThrTrpThrProValAsnGlyThrAspTyrSerLeuAlaLeuVal                               585590595                                                                      TTACCAACCTACAGTTTTTACTATATAAAAGCCAAACTAGAAGAGACA1876                           LeuProThrTyrSerPheTyrTyrIleLysAlaLysLeuGluGluThr                               600605610                                                                      ATAACTCAGGCCAGATCAAAAAAGGGCAAAATGAAGGATTCGGAAACC1924                           IleThrGlnAlaArgSerLysLysGlyLysMetLysAspSerGluThr                               615620625630                                                                   CTGAAGCCAGATAATTTTGAAGAATCTGGCTATACATTCATAGCACCA1972                           LeuLysProAspAsnPheGluGluSerGlyTyrThrPheIleAlaPro                               635640645                                                                      AGAGATTACTGCAATGACCTGAAAATATCGGATAATAACACTGAATTT2020                           ArgAspTyrCysAsnAspLeuLysIleSerAspAsnAsnThrGluPhe                               650655660                                                                      CTTTTAAATTTCAACGAGTTTATTGATAGAAAAACTCCAAACAACCCA2068                           LeuLeuAsnPheAsnGluPheIleAspArgLysThrProAsnAsnPro                               665670675                                                                      TCATGTAACGCGGATTTGATTAATAGAGTCTTGCTTGATGCAGGCTTT2116                           SerCysAsnAlaAspLeuIleAsnArgValLeuLeuAspAlaGlyPhe                               680685690                                                                      ACAAATGAACTTGTCCAAAATTACTGGAGTAAGCAGAAAAATATCAAG2164                           ThrAsnGluLeuValGlnAsnTyrTrpSerLysGlnLysAsnIleLys                               695700705710                                                                   GGAGTGAAAGCACGATTTGTTGTGACTGATGGTGGGATTACCAGAGTT2212                           GlyValLysAlaArgPheValValThrAspGlyGlyIleThrArgVal                               715720725                                                                      TATCCCAAAGAGGCTGGAGAAAATTGGCAAGAAAACCCAGAGACATAT2260                           TyrProLysGluAlaGlyGluAsnTrpGlnGluAsnProGluThrTyr                               730735740                                                                      GAGGACAGCTTCTATAAAAGGAGCCTAGATAATGATAACTATGTTTTC2308                           GluAspSerPheTyrLysArgSerLeuAspAsnAspAsnTyrValPhe                               745750755                                                                      ACTGCTCCCTACTTTAACAAAAGTGGACCTGGTGCCTATGAATCGGGC2356                           ThrAlaProTyrPheAsnLysSerGlyProGlyAlaTyrGluSerGly                               760765770                                                                      ATTATGGTAAGCAAAGCTGTAGAAATATATATTCAAGGGAAACTTCTT2404                           IleMetValSerLysAlaValGluIleTyrIleGlnGlyLysLeuLeu                               775780785790                                                                   AAACCTGCAGTTGTTGGAATTAAAATTGATGTAAATTCCTGGATAGAG2452                           LysProAlaValValGlyIleLysIleAspValAsnSerTrpIleGlu                               795800805                                                                      AATTTCACCAAAACCTCAATCAGAGATCCGTGTGCTGGTCCAGTTTGT2500                           AsnPheThrLysThrSerIleArgAspProCysAlaGlyProValCys                               810815820                                                                      GACTGCAAAAGAAACAGTGACGTAATGGATTGTGTGATTCTGGATGAT2548                           AspCysLysArgAsnSerAspValMetAspCysValIleLeuAspAsp                               825830835                                                                      GGTGGGTTTCTTCTGATGGCAAATCATGATGATTATACTAATCAGATT2596                           GlyGlyPheLeuLeuMetAlaAsnHisAspAspTyrThrAsnGlnIle                               840845850                                                                      GGAAGATTTTTTGGAGAGATTGATCCCAGCTTGATGAGACACCTGGTT2644                           GlyArgPhePheGlyGluIleAspProSerLeuMetArgHisLeuVal                               855860865870                                                                   AATATATCAGTTTATGCTTTTAACAAATCTTATGATTATCAGTCAGTA2692                           AsnIleSerValTyrAlaPheAsnLysSerTyrAspTyrGlnSerVal                               875880885                                                                      TGTGAGCCCGGTGCTGCACCAAAACAAGGAGCAGGACATCGCTCAGCA2740                           CysGluProGlyAlaAlaProLysGlnGlyAlaGlyHisArgSerAla                               890895900                                                                      TATGTGCCATCAGTAGCAGACATATTACAAATTGGCTGGTGGGCCACT2788                           TyrValProSerValAlaAspIleLeuGlnIleGlyTrpTrpAlaThr                               905910915                                                                      GCTGCTGCCTGGTCTATTCTACAGCAGTTTCTCTTGAGTTTGACCTTT2836                           AlaAlaAlaTrpSerIleLeuGlnGlnPheLeuLeuSerLeuThrPhe                               920925930                                                                      CCACGACTCCTTGAGGCAGTTGAGATGGAGGATGATGACTTCACGGCC2884                           ProArgLeuLeuGluAlaValGluMetGluAspAspAspPheThrAla                               935940945950                                                                   TCCCTGTCCAAGCAGAGCTGCATTACTGAACAAACCCAGTATTTCTTC2932                           SerLeuSerLysGlnSerCysIleThrGluGlnThrGlnTyrPhePhe                               955960965                                                                      GATAACGACAGTAAATCATTCAGTGGTGTATTAGACTGTGGAAACTGT2980                           AspAsnAspSerLysSerPheSerGlyValLeuAspCysGlyAsnCys                               970975980                                                                      TCCAGAATCTTTCATGGAGAAAAGCTTATGAACACCAACTTAATATTC3028                           SerArgIlePheHisGlyGluLysLeuMetAsnThrAsnLeuIlePhe                               985990995                                                                      ATAATGGTTGAGAGCAAAGGGACATGTCCATGTGACACACGACTGCTC3076                           IleMetValGluSerLysGlyThrCysProCysAspThrArgLeuLeu                               100010051010                                                                   ATACAAGCGGAGCAGACTTCTGACGGTCCAAATCCTTGTGACATGGTT3124                           IleGlnAlaGluGlnThrSerAspGlyProAsnProCysAspMetVal                               1015102010251030                                                               AAGCAACCTAGATACCGAAAAGGGCCTGATGTCTGCTTTGATAACAAT3172                           LysGlnProArgTyrArgLysGlyProAspValCysPheAspAsnAsn                               103510401045                                                                   GTCTTGGAGGATTATACTGACTGTGGTGGTGTTTCTGGATTAAATCCC3220                           ValLeuGluAspTyrThrAspCysGlyGlyValSerGlyLeuAsnPro                               105010551060                                                                   TCCCTGTGGTATATCATTGGAATCCAGTTTCTACTACTTTGGCTGGTA3268                           SerLeuTrpTyrIleIleGlyIleGlnPheLeuLeuLeuTrpLeuVal                               106510701075                                                                   TCTGGCAGCACACACCGGCTGTTATGACCTTCTAAAAACCAAATCTGCATAGTT3322                     SerGlySerThrHisArgLeuLeu                                                       10801085                                                                       AAACTCCAGACCCTGCCAAAACATGAGCCCTGCCCTCAATTACAGTAACGTAGGGTCAGC3382               TATAAAATCAGACAAACATTAGCTGGGCCTGTTCCATGGCATAACACTAAGGCGCAGACT3442               CCTAAGGCACCCACTGGCTGCATGTCAGGGTGTCAGATCCTTAAACGTGTGTGAATGCTG3502               CATCATCTATGTGTAACATCAAAGCAAAATCCTATACGTGTCCTCTATTGGAAAATTTGG3562               GCGTTTGTTGTTGCATTGTTGGT3585                                                    (2) INFORMATION FOR SEQ ID NO:24:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3564 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 35..3374 ( 1625 to 1639 &  1908 to 1928)                         (D) OTHER INFORMATION: /standard_name="Alpha-2d"                               (ix) FEATURE:                                                                  (A) NAME/KEY: 5'UTR                                                            (B) LOCATION: 1..34                                                            (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 3375..3565                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                       GCGGGGGAGGGGGCATTGATCTTCGATCGCGAAGATGGCTGCTGGCTGCCTG52                         MetAlaAlaGlyCysLeu                                                             15                                                                             CTGGCCTTGACTCTGACACTTTTCCAATCTTTGCTCATCGGCCCCTCG100                            LeuAlaLeuThrLeuThrLeuPheGlnSerLeuLeuIleGlyProSer                               101520                                                                         TCGGAGGAGCCGTTCCCTTCGGCCGTCACTATCAAATCATGGGTGGAT148                            SerGluGluProPheProSerAlaValThrIleLysSerTrpValAsp                               253035                                                                         AAGATGCAAGAAGACCTTGTCACACTGGCAAAAACAGCAAGTGGAGTC196                            LysMetGlnGluAspLeuValThrLeuAlaLysThrAlaSerGlyVal                               404550                                                                         AATCAGCTTGTTGATATTTATGAGAAATATCAAGATTTGTATACTGTG244                            AsnGlnLeuValAspIleTyrGluLysTyrGlnAspLeuTyrThrVal                               55606570                                                                       GAACCAAATAATGCACGCCAGCTGGTAGAAATTGCAGCCAGGGATATT292                            GluProAsnAsnAlaArgGlnLeuValGluIleAlaAlaArgAspIle                               758085                                                                         GAGAAACTTCTGAGCAACAGATCTAAAGCCCTGGTGAGCCTGGCATTG340                            GluLysLeuLeuSerAsnArgSerLysAlaLeuValSerLeuAlaLeu                               9095100                                                                        GAAGCGGAGAAAGTTCAAGCAGCTCACCAGTGGAGAGAAGATTTTGCA388                            GluAlaGluLysValGlnAlaAlaHisGlnTrpArgGluAspPheAla                               105110115                                                                      AGCAATGAAGTTGTCTACTACAATGCAAAGGATGATCTCGATCCTGAG436                            SerAsnGluValValTyrTyrAsnAlaLysAspAspLeuAspProGlu                               120125130                                                                      AAAAATGACAGTGAGCCAGGCAGCCAGAGGATAAAACCTGTTTTCATT484                            LysAsnAspSerGluProGlySerGlnArgIleLysProValPheIle                               135140145150                                                                   GAAGATGCTAATTTTGGACGACAAATATCTTATCAGCACGCAGCAGTC532                            GluAspAlaAsnPheGlyArgGlnIleSerTyrGlnHisAlaAlaVal                               155160165                                                                      CATATTCCTACTGACATCTATGAGGGCTCAACAATTGTGTTAAATGAA580                            HisIleProThrAspIleTyrGluGlySerThrIleValLeuAsnGlu                               170175180                                                                      CTCAACTGGACAAGTGCCTTAGATGAAGTTTTCAAAAAGAATCGCGAG628                            LeuAsnTrpThrSerAlaLeuAspGluValPheLysLysAsnArgGlu                               185190195                                                                      GAAGACCCTTCATTATTGTGGCAGGTTTTTGGCAGTGCCACTGGCCTA676                            GluAspProSerLeuLeuTrpGlnValPheGlySerAlaThrGlyLeu                               200205210                                                                      GCTCGATATTATCCAGCTTCACCATGGGTTGATAATAGTAGAACTCCA724                            AlaArgTyrTyrProAlaSerProTrpValAspAsnSerArgThrPro                               215220225230                                                                   AATAAGATTGACCTTTATGATGTACGCAGAAGACCATGGTACATCCAA772                            AsnLysIleAspLeuTyrAspValArgArgArgProTrpTyrIleGln                               235240245                                                                      GGAGCTGCATCTCCTAAAGACATGCTTATTCTGGTGGATGTGAGTGGA820                            GlyAlaAlaSerProLysAspMetLeuIleLeuValAspValSerGly                               250255260                                                                      AGTGTTAGTGGATTGACACTTAAACTGATCCGAACATCTGTCTCCGAA868                            SerValSerGlyLeuThrLeuLysLeuIleArgThrSerValSerGlu                               265270275                                                                      ATGTTAGAAACCCTCTCAGATGATGATTTCGTGAATGTAGCTTCATTT916                            MetLeuGluThrLeuSerAspAspAspPheValAsnValAlaSerPhe                               280285290                                                                      AACAGCAATGCTCAGGATGTAAGCTGTTTTCAGCACCTTGTCCAAGCA964                            AsnSerAsnAlaGlnAspValSerCysPheGlnHisLeuValGlnAla                               295300305310                                                                   AATGTAAGAAATAAAAAAGTGTTGAAAGACGCGGTGAATAATATCACA1012                           AsnValArgAsnLysLysValLeuLysAspAlaValAsnAsnIleThr                               315320325                                                                      GCCAAAGGAATTACAGATTATAAGAAGGGCTTTAGTTTTGCTTTTGAA1060                           AlaLysGlyIleThrAspTyrLysLysGlyPheSerPheAlaPheGlu                               330335340                                                                      CAGCTGCTTAATTATAATGTTTCCAGAGCAAACTGCAATAAGATTATT1108                           GlnLeuLeuAsnTyrAsnValSerArgAlaAsnCysAsnLysIleIle                               345350355                                                                      ATGCTATTCACGGATGGAGGAGAAGAGAGAGCCCAGGAGATATTTAAC1156                           MetLeuPheThrAspGlyGlyGluGluArgAlaGlnGluIlePheAsn                               360365370                                                                      AAATACAATAAAGATAAAAAAGTACGTGTATTCAGGTTTTCAGTTGGT1204                           LysTyrAsnLysAspLysLysValArgValPheArgPheSerValGly                               375380385390                                                                   CAACACAATTATGAGAGAGGACCTATTCAGTGGATGGCCTGTGAAAAC1252                           GlnHisAsnTyrGluArgGlyProIleGlnTrpMetAlaCysGluAsn                               395400405                                                                      AAAGGTTATTATTATGAAATTCCTTCCATTGGTGCAATAAGAATCAAT1300                           LysGlyTyrTyrTyrGluIleProSerIleGlyAlaIleArgIleAsn                               410415420                                                                      ACTCAGGAATATTTGGATGTTTTGGGAAGACCAATGGTTTTAGCAGGA1348                           ThrGlnGluTyrLeuAspValLeuGlyArgProMetValLeuAlaGly                               425430435                                                                      GACAAAGCTAAGCAAGTCCAATGGACAAATGTGTACCTGGATGCATTG1396                           AspLysAlaLysGlnValGlnTrpThrAsnValTyrLeuAspAlaLeu                               440445450                                                                      GAACTGGGACTTGTCATTACTGGAACTCTTCCGGTCTTCAACATAACC1444                           GluLeuGlyLeuValIleThrGlyThrLeuProValPheAsnIleThr                               455460465470                                                                   GGCCAATTTGAAAATAAGACAAACTTAAAGAACCAGCTGATTCTTGGT1492                           GlyGlnPheGluAsnLysThrAsnLeuLysAsnGlnLeuIleLeuGly                               475480485                                                                      GTGATGGGAGTAGATGTGTCTTTGGAAGATATTAAAAGACTGACACCA1540                           ValMetGlyValAspValSerLeuGluAspIleLysArgLeuThrPro                               490495500                                                                      CGTTTTACACTGTGCCCCAATGGGTATTACTTTGCAATCGATCCTAAT1588                           ArgPheThrLeuCysProAsnGlyTyrTyrPheAlaIleAspProAsn                               505510515                                                                      GGTTATGTTTTATTACATCCAAATCTTCAGCCAAAGGAGCCAGTAACA1636                           GlyTyrValLeuLeuHisProAsnLeuGlnProLysGluProValThr                               520525530                                                                      TTGGATTTCCTTGATGCAGAGTTAGAGAATGATATTAAAGTGGAGATT1684                           LeuAspPheLeuAspAlaGluLeuGluAsnAspIleLysValGluIle                               535540545550                                                                   CGAAATAAGATGATTGATGGGGAAAGTGGAGAAAAAACATTCAGAACT1732                           ArgAsnLysMetIleAspGlyGluSerGlyGluLysThrPheArgThr                               555560565                                                                      CTGGTTAAATCTCAAGATGAGAGATATATTGACAAAGGAAACAGGACA1780                           LeuValLysSerGlnAspGluArgTyrIleAspLysGlyAsnArgThr                               570575580                                                                      TACACATGGACACCTGTCAATGGCACAGATTACAGTTTGGCCTTGGTA1828                           TyrThrTrpThrProValAsnGlyThrAspTyrSerLeuAlaLeuVal                               585590595                                                                      TTACCAACCTACAGTTTTTACTATATAAAAGCCAAACTAGAAGAGACA1876                           LeuProThrTyrSerPheTyrTyrIleLysAlaLysLeuGluGluThr                               600605610                                                                      ATAACTCAGGCCAGATATTCGGAAACCCTGAAGCCAGATAATTTTGAA1924                           IleThrGlnAlaArgTyrSerGluThrLeuLysProAspAsnPheGlu                               615620625630                                                                   GAATCTGGCTATACATTCATAGCACCAAGAGATTACTGCAATGACCTG1972                           GluSerGlyTyrThrPheIleAlaProArgAspTyrCysAsnAspLeu                               635640645                                                                      AAAATATCGGATAATAACACTGAATTTCTTTTAAATTTCAACGAGTTT2020                           LysIleSerAspAsnAsnThrGluPheLeuLeuAsnPheAsnGluPhe                               650655660                                                                      ATTGATAGAAAAACTCCAAACAACCCATCATGTAACGCGGATTTGATT2068                           IleAspArgLysThrProAsnAsnProSerCysAsnAlaAspLeuIle                               665670675                                                                      AATAGAGTCTTGCTTGATGCAGGCTTTACAAATGAACTTGTCCAAAAT2116                           AsnArgValLeuLeuAspAlaGlyPheThrAsnGluLeuValGlnAsn                               680685690                                                                      TACTGGAGTAAGCAGAAAAATATCAAGGGAGTGAAAGCACGATTTGTT2164                           TyrTrpSerLysGlnLysAsnIleLysGlyValLysAlaArgPheVal                               695700705710                                                                   GTGACTGATGGTGGGATTACCAGAGTTTATCCCAAAGAGGCTGGAGAA2212                           ValThrAspGlyGlyIleThrArgValTyrProLysGluAlaGlyGlu                               715720725                                                                      AATTGGCAAGAAAACCCAGAGACATATGAGGACAGCTTCTATAAAAGG2260                           AsnTrpGlnGluAsnProGluThrTyrGluAspSerPheTyrLysArg                               730735740                                                                      AGCCTAGATAATGATAACTATGTTTTCACTGCTCCCTACTTTAACAAA2308                           SerLeuAspAsnAspAsnTyrValPheThrAlaProTyrPheAsnLys                               745750755                                                                      AGTGGACCTGGTGCCTATGAATCGGGCATTATGGTAAGCAAAGCTGTA2356                           SerGlyProGlyAlaTyrGluSerGlyIleMetValSerLysAlaVal                               760765770                                                                      GAAATATATATTCAAGGGAAACTTCTTAAACCTGCAGTTGTTGGAATT2404                           GluIleTyrIleGlnGlyLysLeuLeuLysProAlaValValGlyIle                               775780785790                                                                   AAAATTGATGTAAATTCCTGGATAGAGAATTTCACCAAAACCTCAATC2452                           LysIleAspValAsnSerTrpIleGluAsnPheThrLysThrSerIle                               795800805                                                                      AGAGATCCGTGTGCTGGTCCAGTTTGTGACTGCAAAAGAAACAGTGAC2500                           ArgAspProCysAlaGlyProValCysAspCysLysArgAsnSerAsp                               810815820                                                                      GTAATGGATTGTGTGATTCTGGATGATGGTGGGTTTCTTCTGATGGCA2548                           ValMetAspCysValIleLeuAspAspGlyGlyPheLeuLeuMetAla                               825830835                                                                      AATCATGATGATTATACTAATCAGATTGGAAGATTTTTTGGAGAGATT2596                           AsnHisAspAspTyrThrAsnGlnIleGlyArgPhePheGlyGluIle                               840845850                                                                      GATCCCAGCTTGATGAGACACCTGGTTAATATATCAGTTTATGCTTTT2644                           AspProSerLeuMetArgHisLeuValAsnIleSerValTyrAlaPhe                               855860865870                                                                   AACAAATCTTATGATTATCAGTCAGTATGTGAGCCCGGTGCTGCACCA2692                           AsnLysSerTyrAspTyrGlnSerValCysGluProGlyAlaAlaPro                               875880885                                                                      AAACAAGGAGCAGGACATCGCTCAGCATATGTGCCATCAGTAGCAGAC2740                           LysGlnGlyAlaGlyHisArgSerAlaTyrValProSerValAlaAsp                               890895900                                                                      ATATTACAAATTGGCTGGTGGGCCACTGCTGCTGCCTGGTCTATTCTA2788                           IleLeuGlnIleGlyTrpTrpAlaThrAlaAlaAlaTrpSerIleLeu                               905910915                                                                      CAGCAGTTTCTCTTGAGTTTGACCTTTCCACGACTCCTTGAGGCAGTT2836                           GlnGlnPheLeuLeuSerLeuThrPheProArgLeuLeuGluAlaVal                               920925930                                                                      GAGATGGAGGATGATGACTTCACGGCCTCCCTGTCCAAGCAGAGCTGC2884                           GluMetGluAspAspAspPheThrAlaSerLeuSerLysGlnSerCys                               935940945950                                                                   ATTACTGAACAAACCCAGTATTTCTTCGATAACGACAGTAAATCATTC2932                           IleThrGluGlnThrGlnTyrPhePheAspAsnAspSerLysSerPhe                               955960965                                                                      AGTGGTGTATTAGACTGTGGAAACTGTTCCAGAATCTTTCATGGAGAA2980                           SerGlyValLeuAspCysGlyAsnCysSerArgIlePheHisGlyGlu                               970975980                                                                      AAGCTTATGAACACCAACTTAATATTCATAATGGTTGAGAGCAAAGGG3028                           LysLeuMetAsnThrAsnLeuIlePheIleMetValGluSerLysGly                               985990995                                                                      ACATGTCCATGTGACACACGACTGCTCATACAAGCGGAGCAGACTTCT3076                           ThrCysProCysAspThrArgLeuLeuIleGlnAlaGluGlnThrSer                               100010051010                                                                   GACGGTCCAAATCCTTGTGACATGGTTAAGCAACCTAGATACCGAAAA3124                           AspGlyProAsnProCysAspMetValLysGlnProArgTyrArgLys                               1015102010251030                                                               GGGCCTGATGTCTGCTTTGATAACAATGTCTTGGAGGATTATACTGAC3172                           GlyProAspValCysPheAspAsnAsnValLeuGluAspTyrThrAsp                               103510401045                                                                   TGTGGTGGTGTTTCTGGATTAAATCCCTCCCTGTGGTATATCATTGGA3220                           CysGlyGlyValSerGlyLeuAsnProSerLeuTrpTyrIleIleGly                               105010551060                                                                   ATCCAGTTTCTACTACTTTGGCTGGTATCTGGCAGCACACACCGGCTG3268                           IleGlnPheLeuLeuLeuTrpLeuValSerGlySerThrHisArgLeu                               106510701075                                                                   TTATGACCTTCTAAAAACCAAATCTGCATAGTTAAACTCCAGACCCTGCCAAA3321                      Leu                                                                            ACATGAGCCCTGCCCTCAATTACAGTAACGTAGGGTCAGCTATAAAATCAGACAAACATT3381               AGCTGGGCCTGTTCCATGGCATAACACTAAGGCGCAGACTCCTAAGGCACCCACTGGCTG3441               CATGTCAGGGTGTCAGATCCTTAAACGTGTGTGAATGCTGCATCATCTATGTGTAACATC3501               AAAGCAAAATCCTATACGTGTCCTCTATTGGAAAATTTGGGCGTTTGTTGTTGCATTGTT3561               GGT3564                                                                        (2) INFORMATION FOR SEQ ID NO:25:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3579 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 35..3289                                                         (D) OTHER INFORMATION: /standard_name="Alpha-2e"                               (ix) FEATURE:                                                                  (A) NAME/KEY: 5'UTR                                                            (B) LOCATION: 1..34                                                            (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 3289..3579                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                       GCGGGGGAGGGGGCATTGATCTTCGATCGCGAAGATGGCTGCTGGCTGCCTG52                         MetAlaAlaGlyCysLeu                                                             15                                                                             CTGGCCTTGACTCTGACACTTTTCCAATCTTTGCTCATCGGCCCCTCG100                            LeuAlaLeuThrLeuThrLeuPheGlnSerLeuLeuIleGlyProSer                               101520                                                                         TCGGAGGAGCCGTTCCCTTCGGCCGTCACTATCAAATCATGGGTGGAT148                            SerGluGluProPheProSerAlaValThrIleLysSerTrpValAsp                               253035                                                                         AAGATGCAAGAAGACCTTGTCACACTGGCAAAAACAGCAAGTGGAGTC196                            LysMetGlnGluAspLeuValThrLeuAlaLysThrAlaSerGlyVal                               404550                                                                         AATCAGCTTGTTGATATTTATGAGAAATATCAAGATTTGTATACTGTG244                            AsnGlnLeuValAspIleTyrGluLysTyrGlnAspLeuTyrThrVal                               55606570                                                                       GAACCAAATAATGCACGCCAGCTGGTAGAAATTGCAGCCAGGGATATT292                            GluProAsnAsnAlaArgGlnLeuValGluIleAlaAlaArgAspIle                               758085                                                                         GAGAAACTTCTGAGCAACAGATCTAAAGCCCTGGTGAGCCTGGCATTG340                            GluLysLeuLeuSerAsnArgSerLysAlaLeuValSerLeuAlaLeu                               9095100                                                                        GAAGCGGAGAAAGTTCAAGCAGCTCACCAGTGGAGAGAAGATTTTGCA388                            GluAlaGluLysValGlnAlaAlaHisGlnTrpArgGluAspPheAla                               105110115                                                                      AGCAATGAAGTTGTCTACTACAATGCAAAGGATGATCTCGATCCTGAG436                            SerAsnGluValValTyrTyrAsnAlaLysAspAspLeuAspProGlu                               120125130                                                                      AAAAATGACAGTGAGCCAGGCAGCCAGAGGATAAAACCTGTTTTCATT484                            LysAsnAspSerGluProGlySerGlnArgIleLysProValPheIle                               135140145150                                                                   GAAGATGCTAATTTTGGACGACAAATATCTTATCAGCACGCAGCAGTC532                            GluAspAlaAsnPheGlyArgGlnIleSerTyrGlnHisAlaAlaVal                               155160165                                                                      CATATTCCTACTGACATCTATGAGGGCTCAACAATTGTGTTAAATGAA580                            HisIleProThrAspIleTyrGluGlySerThrIleValLeuAsnGlu                               170175180                                                                      CTCAACTGGACAAGTGCCTTAGATGAAGTTTTCAAAAAGAATCGCGAG628                            LeuAsnTrpThrSerAlaLeuAspGluValPheLysLysAsnArgGlu                               185190195                                                                      GAAGACCCTTCATTATTGTGGCAGGTTTTTGGCAGTGCCACTGGCCTA676                            GluAspProSerLeuLeuTrpGlnValPheGlySerAlaThrGlyLeu                               200205210                                                                      GCTCGATATTATCCAGCTTCACCATGGGTTGATAATAGTAGAACTCCA724                            AlaArgTyrTyrProAlaSerProTrpValAspAsnSerArgThrPro                               215220225230                                                                   AATAAGATTGACCTTTATGATGTACGCAGAAGACCATGGTACATCCAA772                            AsnLysIleAspLeuTyrAspValArgArgArgProTrpTyrIleGln                               235240245                                                                      GGAGCTGCATCTCCTAAAGACATGCTTATTCTGGTGGATGTGAGTGGA820                            GlyAlaAlaSerProLysAspMetLeuIleLeuValAspValSerGly                               250255260                                                                      AGTGTTAGTGGATTGACACTTAAACTGATCCGAACATCTGTCTCCGAA868                            SerValSerGlyLeuThrLeuLysLeuIleArgThrSerValSerGlu                               265270275                                                                      ATGTTAGAAACCCTCTCAGATGATGATTTCGTGAATGTAGCTTCATTT916                            MetLeuGluThrLeuSerAspAspAspPheValAsnValAlaSerPhe                               280285290                                                                      AACAGCAATGCTCAGGATGTAAGCTGTTTTCAGCACCTTGTCCAAGCA964                            AsnSerAsnAlaGlnAspValSerCysPheGlnHisLeuValGlnAla                               295300305310                                                                   AATGTAAGAAATAAAAAAGTGTTGAAAGACGCGGTGAATAATATCACA1012                           AsnValArgAsnLysLysValLeuLysAspAlaValAsnAsnIleThr                               315320325                                                                      GCCAAAGGAATTACAGATTATAAGAAGGGCTTTAGTTTTGCTTTTGAA1060                           AlaLysGlyIleThrAspTyrLysLysGlyPheSerPheAlaPheGlu                               330335340                                                                      CAGCTGCTTAATTATAATGTTTCCAGAGCAAACTGCAATAAGATTATT1108                           GlnLeuLeuAsnTyrAsnValSerArgAlaAsnCysAsnLysIleIle                               345350355                                                                      ATGCTATTCACGGATGGAGGAGAAGAGAGAGCCCAGGAGATATTTAAC1156                           MetLeuPheThrAspGlyGlyGluGluArgAlaGlnGluIlePheAsn                               360365370                                                                      AAATACAATAAAGATAAAAAAGTACGTGTATTCAGGTTTTCAGTTGGT1204                           LysTyrAsnLysAspLysLysValArgValPheArgPheSerValGly                               375380385390                                                                   CAACACAATTATGAGAGAGGACCTATTCAGTGGATGGCCTGTGAAAAC1252                           GlnHisAsnTyrGluArgGlyProIleGlnTrpMetAlaCysGluAsn                               395400405                                                                      AAAGGTTATTATTATGAAATTCCTTCCATTGGTGCAATAAGAATCAAT1300                           LysGlyTyrTyrTyrGluIleProSerIleGlyAlaIleArgIleAsn                               410415420                                                                      ACTCAGGAATATTTGGATGTTTTGGGAAGACCAATGGTTTTAGCAGGA1348                           ThrGlnGluTyrLeuAspValLeuGlyArgProMetValLeuAlaGly                               425430435                                                                      GACAAAGCTAAGCAAGTCCAATGGACAAATGTGTACCTGGATGCATTG1396                           AspLysAlaLysGlnValGlnTrpThrAsnValTyrLeuAspAlaLeu                               440445450                                                                      GAACTGGGACTTGTCATTACTGGAACTCTTCCGGTCTTCAACATAACC1444                           GluLeuGlyLeuValIleThrGlyThrLeuProValPheAsnIleThr                               455460465470                                                                   GGCCAATTTGAAAATAAGACAAACTTAAAGAACCAGCTGATTCTTGGT1492                           GlyGlnPheGluAsnLysThrAsnLeuLysAsnGlnLeuIleLeuGly                               475480485                                                                      GTGATGGGAGTAGATGTGTCTTTGGAAGATATTAAAAGACTGACACCA1540                           ValMetGlyValAspValSerLeuGluAspIleLysArgLeuThrPro                               490495500                                                                      CGTTTTACACTGTGCCCCAATGGGTATTACTTTGCAATCGATCCTAAT1588                           ArgPheThrLeuCysProAsnGlyTyrTyrPheAlaIleAspProAsn                               505510515                                                                      GGTTATGTTTTATTACATCCAAATCTTCAGCCAAAGAACCCCAAATCT1636                           GlyTyrValLeuLeuHisProAsnLeuGlnProLysAsnProLysSer                               520525530                                                                      CAGGAGCCAGTAACATTGGATTTCCTTGATGCAGAGTTAGAGAATGAT1684                           GlnGluProValThrLeuAspPheLeuAspAlaGluLeuGluAsnAsp                               535540545550                                                                   ATTAAAGTGGAGATTCGAAATAAGATGATTGATGGGGAAAGTGGAGAA1732                           IleLysValGluIleArgAsnLysMetIleAspGlyGluSerGlyGlu                               555560565                                                                      AAAACATTCAGAACTCTGGTTAAATCTCAAGATGAGAGATATATTGAC1780                           LysThrPheArgThrLeuValLysSerGlnAspGluArgTyrIleAsp                               570575580                                                                      AAAGGAAACAGGACATACACATGGACACCTGTCAATGGCACAGATTAC1828                           LysGlyAsnArgThrTyrThrTrpThrProValAsnGlyThrAspTyr                               585590595                                                                      AGTTTGGCCTTGGTATTACCAACCTACAGTTTTTACTATATAAAAGCC1876                           SerLeuAlaLeuValLeuProThrTyrSerPheTyrTyrIleLysAla                               600605610                                                                      AAACTAGAAGAGACAATAACTCAGGCCAGATATTCGGAAACCCTGAAG1924                           LysLeuGluGluThrIleThrGlnAlaArgTyrSerGluThrLeuLys                               615620625630                                                                   CCAGATAATTTTGAAGAATCTGGCTATACATTCATAGCACCAAGAGAT1972                           ProAspAsnPheGluGluSerGlyTyrThrPheIleAlaProArgAsp                               635640645                                                                      TACTGCAATGACCTGAAAATATCGGATAATAACACTGAATTTCTTTTA2020                           TyrCysAsnAspLeuLysIleSerAspAsnAsnThrGluPheLeuLeu                               650655660                                                                      AATTTCAACGAGTTTATTGATAGAAAAACTCCAAACAACCCATCATGT2068                           AsnPheAsnGluPheIleAspArgLysThrProAsnAsnProSerCys                               665670675                                                                      AACGCGGATTTGATTAATAGAGTCTTGCTTGATGCAGGCTTTACAAAT2116                           AsnAlaAspLeuIleAsnArgValLeuLeuAspAlaGlyPheThrAsn                               680685690                                                                      GAACTTGTCCAAAATTACTGGAGTAAGCAGAAAAATATCAAGGGAGTG2164                           GluLeuValGlnAsnTyrTrpSerLysGlnLysAsnIleLysGlyVal                               695700705710                                                                   AAAGCACGATTTGTTGTGACTGATGGTGGGATTACCAGAGTTTATCCC2212                           LysAlaArgPheValValThrAspGlyGlyIleThrArgValTyrPro                               715720725                                                                      AAAGAGGCTGGAGAAAATTGGCAAGAAAACCCAGAGACATATGAGGAC2260                           LysGluAlaGlyGluAsnTrpGlnGluAsnProGluThrTyrGluAsp                               730735740                                                                      AGCTTCTATAAAAGGAGCCTAGATAATGATAACTATGTTTTCACTGCT2308                           SerPheTyrLysArgSerLeuAspAsnAspAsnTyrValPheThrAla                               745750755                                                                      CCCTACTTTAACAAAAGTGGACCTGGTGCCTATGAATCGGGCATTATG2356                           ProTyrPheAsnLysSerGlyProGlyAlaTyrGluSerGlyIleMet                               760765770                                                                      GTAAGCAAAGCTGTAGAAATATATATTCAAGGGAAACTTCTTAAACCT2404                           ValSerLysAlaValGluIleTyrIleGlnGlyLysLeuLeuLysPro                               775780785790                                                                   GCAGTTGTTGGAATTAAAATTGATGTAAATTCCTGGATAGAGAATTTC2452                           AlaValValGlyIleLysIleAspValAsnSerTrpIleGluAsnPhe                               795800805                                                                      ACCAAAACCTCAATCAGAGATCCGTGTGCTGGTCCAGTTTGTGACTGC2500                           ThrLysThrSerIleArgAspProCysAlaGlyProValCysAspCys                               810815820                                                                      AAAAGAAACAGTGACGTAATGGATTGTGTGATTCTGGATGATGGTGGG2548                           LysArgAsnSerAspValMetAspCysValIleLeuAspAspGlyGly                               825830835                                                                      TTTCTTCTGATGGCAAATCATGATGATTATACTAATCAGATTGGAAGA2596                           PheLeuLeuMetAlaAsnHisAspAspTyrThrAsnGlnIleGlyArg                               840845850                                                                      TTTTTTGGAGAGATTGATCCCAGCTTGATGAGACACCTGGTTAATATA2644                           PhePheGlyGluIleAspProSerLeuMetArgHisLeuValAsnIle                               855860865870                                                                   TCAGTTTATGCTTTTAACAAATCTTATGATTATCAGTCAGTATGTGAG2692                           SerValTyrAlaPheAsnLysSerTyrAspTyrGlnSerValCysGlu                               875880885                                                                      CCCGGTGCTGCACCAAAACAAGGAGCAGGACATCGCTCAGCATATGTG2740                           ProGlyAlaAlaProLysGlnGlyAlaGlyHisArgSerAlaTyrVal                               890895900                                                                      CCATCAGTAGCAGACATATTACAAATTGGCTGGTGGGCCACTGCTGCT2788                           ProSerValAlaAspIleLeuGlnIleGlyTrpTrpAlaThrAlaAla                               905910915                                                                      GCCTGGTCTATTCTACAGCAGTTTCTCTTGAGTTTGACCTTTCCACGA2836                           AlaTrpSerIleLeuGlnGlnPheLeuLeuSerLeuThrPheProArg                               920925930                                                                      CTCCTTGAGGCAGTTGAGATGGAGGATGATGACTTCACGGCCTCCCTG2884                           LeuLeuGluAlaValGluMetGluAspAspAspPheThrAlaSerLeu                               935940945950                                                                   TCCAAGCAGAGCTGCATTACTGAACAAACCCAGTATTTCTTCGATAAC2932                           SerLysGlnSerCysIleThrGluGlnThrGlnTyrPhePheAspAsn                               955960965                                                                      GACAGTAAATCATTCAGTGGTGTATTAGACTGTGGAAACTGTTCCAGA2980                           AspSerLysSerPheSerGlyValLeuAspCysGlyAsnCysSerArg                               970975980                                                                      ATCTTTCATGGAGAAAAGCTTATGAACACCAACTTAATATTCATAATG3028                           IlePheHisGlyGluLysLeuMetAsnThrAsnLeuIlePheIleMet                               985990995                                                                      GTTGAGAGCAAAGGGACATGTCCATGTGACACACGACTGCTCATACAA3076                           ValGluSerLysGlyThrCysProCysAspThrArgLeuLeuIleGln                               100010051010                                                                   GCGGAGCAGACTTCTGACGGTCCAAATCCTTGTGACATGGTTAAGCAA3124                           AlaGluGlnThrSerAspGlyProAsnProCysAspMetValLysGln                               1015102010251030                                                               CCTAGATACCGAAAAGGGCCTGATGTCTGCTTTGATAACAATGTCTTG3172                           ProArgTyrArgLysGlyProAspValCysPheAspAsnAsnValLeu                               103510401045                                                                   GAGGATTATACTGACTGTGGTGGTGTTTCTGGATTAAATCCCTCCCTG3220                           GluAspTyrThrAspCysGlyGlyValSerGlyLeuAsnProSerLeu                               105010551060                                                                   TGGTATATCATTGGAATCCAGTTTCTACTACTTTGGCTGGTATCTGGC3268                           TrpTyrIleIleGlyIleGlnPheLeuLeuLeuTrpLeuValSerGly                               106510701075                                                                   AGCACACACCGGCTGTTATGACCTTCTAAAAACCAAATCTGCATAGTT3316                           SerThrHisArgLeuLeu                                                             1080108                                                                        AAACTCCAGACCCTGCCAAAACATGAGCCCTGCCCTCAATTACAGTAACGTAGGGTCAGC3376               TATAAAATCAGACAAACATTAGCTGGGCCTGTTCCATGGCATAACACTAAGGCGCAGACT3436               CCTAAGGCACCCACTGGCTGCATGTCAGGGTGTCAGATCCTTAAACGTGTGTGAATGCTG3496               CATCATCTATGTGTAACATCAAAGCAAAATCCTATACGTGTCCTCTATTGGAAAATTTGG3556               GCGTTTGTTGTTGCATTGTTGGT3579                                                    (2) INFORMATION FOR SEQ ID NO:26:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1681 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..1437                                                          (D) OTHER INFORMATION: /standard_name="Beta-1-1"                               (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 1435..1681                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                       ATGGTCCAGAAGACCAGCATGTCCCGGGGCCCTTACCCACCCTCCCAG48                             MetValGlnLysThrSerMetSerArgGlyProTyrProProSerGln                               151015                                                                         GAGATCCCCATGGAGGTCTTCGACCCCAGCCCGCAGGGCAAATACAGC96                             GluIleProMetGluValPheAspProSerProGlnGlyLysTyrSer                               202530                                                                         AAGAGGAAAGGGCGATTCAAACGGTCAGATGGGAGCACGTCCTCGGAT144                            LysArgLysGlyArgPheLysArgSerAspGlySerThrSerSerAsp                               354045                                                                         ACCACATCCAACAGCTTTGTCCGCCAGGGCTCAGCGGAGTCCTACACC192                            ThrThrSerAsnSerPheValArgGlnGlySerAlaGluSerTyrThr                               505560                                                                         AGCCGTCCATCAGACTCTGATGTATCTCTGGAGGAGGACCGGGAAGCC240                            SerArgProSerAspSerAspValSerLeuGluGluAspArgGluAla                               65707580                                                                       TTAAGGAAGGAAGCAGAGCGCCAGGCATTAGCGCAGCTCGAGAAGGCC288                            LeuArgLysGluAlaGluArgGlnAlaLeuAlaGlnLeuGluLysAla                               859095                                                                         AAGACCAAGCCAGTGGCATTTGCTGTGCGGACAAATGTTGGCTACAAT336                            LysThrLysProValAlaPheAlaValArgThrAsnValGlyTyrAsn                               100105110                                                                      CCGTCTCCAGGGGATGAGGTGCCTGTGCAGGGAGTGGCCATCACCTTC384                            ProSerProGlyAspGluValProValGlnGlyValAlaIleThrPhe                               115120125                                                                      GAGCCCAAAGACTTCCTGCACATCAAGGAGAAATACAATAATGACTGG432                            GluProLysAspPheLeuHisIleLysGluLysTyrAsnAsnAspTrp                               130135140                                                                      TGGATCGGGCGGCTGGTGAAGGAGGGCTGTGAGGTTGGCTTCATTCCC480                            TrpIleGlyArgLeuValLysGluGlyCysGluValGlyPheIlePro                               145150155160                                                                   AGCCCCGTCAAACTGGACAGCCTTCGCCTGCTGCAGGAACAGAAGCTG528                            SerProValLysLeuAspSerLeuArgLeuLeuGlnGluGlnLysLeu                               165170175                                                                      CGCCAGAACCGCCTCGGCTCCAGCAAATCAGGCGATAACTCCAGTTCC576                            ArgGlnAsnArgLeuGlySerSerLysSerGlyAspAsnSerSerSer                               180185190                                                                      AGTCTGGGAGATGTGGTGACTGGCACCCGCCGCCCCACACCCCCTGCC624                            SerLeuGlyAspValValThrGlyThrArgArgProThrProProAla                               195200205                                                                      AGTGGTAATGAAATGACTAACTTAGCCTTTGAACTAGACCCCCTAGAG672                            SerGlyAsnGluMetThrAsnLeuAlaPheGluLeuAspProLeuGlu                               210215220                                                                      TTAGAGGAGGAAGAGGCTGAGCTTGGTGAGCAGAGTGGCTCTGCCAAG720                            LeuGluGluGluGluAlaGluLeuGlyGluGlnSerGlySerAlaLys                               225230235240                                                                   ACTAGTGTTAGCAGTGTCACCACCCCGCCACCCCATGGCAAACGCATC768                            ThrSerValSerSerValThrThrProProProHisGlyLysArgIle                               245250255                                                                      CCCTTCTTTAAGAAGACAGAGCATGTGCCCCCCTATGACGTGGTGCCT816                            ProPhePheLysLysThrGluHisValProProTyrAspValValPro                               260265270                                                                      TCCATGAGGCCCATCATCCTGGTGGGACCGTCGCTCAAGGGCTACGAG864                            SerMetArgProIleIleLeuValGlyProSerLeuLysGlyTyrGlu                               275280285                                                                      GTTACAGACATGATGCAGAAAGCTTTATTTGACTTCTTGAAGCATCGG912                            ValThrAspMetMetGlnLysAlaLeuPheAspPheLeuLysHisArg                               290295300                                                                      TTTGATGGCAGGATCTCCATCACTCGTGTGACGGCAGATATTTCCCTG960                            PheAspGlyArgIleSerIleThrArgValThrAlaAspIleSerLeu                               305310315320                                                                   GCTAAGCGCTCAGTTCTCAACAACCCCAGCAAACACATCATCATTGAG1008                           AlaLysArgSerValLeuAsnAsnProSerLysHisIleIleIleGlu                               325330335                                                                      CGCTCCAACACACGCTCCAGCCTGGCTGAGGTGCAGAGTGAAATCGAG1056                           ArgSerAsnThrArgSerSerLeuAlaGluValGlnSerGluIleGlu                               340345350                                                                      CGAATCTTCGAGCTGGCCCGGACCCTTCAGTTGGTCGCTCTGGATGCT1104                           ArgIlePheGluLeuAlaArgThrLeuGlnLeuValAlaLeuAspAla                               355360365                                                                      GACACCATCAATCACCCAGCCCAGCTGTCCAAGACCTCGCTGGCCCCC1152                           AspThrIleAsnHisProAlaGlnLeuSerLysThrSerLeuAlaPro                               370375380                                                                      ATCATTGTTTACATCAAGATCACCTCTCCCAAGGTACTTCAAAGGCTC1200                           IleIleValTyrIleLysIleThrSerProLysValLeuGlnArgLeu                               385390395400                                                                   ATCAAGTCCCGAGGAAAGTCTCAGTCCAAACACCTCAATGTCCAAATA1248                           IleLysSerArgGlyLysSerGlnSerLysHisLeuAsnValGlnIle                               405410415                                                                      GCGGCCTCGGAAAAGCTGGCACAGTGCCCCCCTGAAATGTTTGACATC1296                           AlaAlaSerGluLysLeuAlaGlnCysProProGluMetPheAspIle                               420425430                                                                      ATCCTGGATGAGAACCAATTGGAGGATGCCTGCGAGCATCTGGCGGAG1344                           IleLeuAspGluAsnGlnLeuGluAspAlaCysGluHisLeuAlaGlu                               435440445                                                                      TACTTGGAAGCCTATTGGAAGGCCACACACCCGCCCAGCAGCACGCCA1392                           TyrLeuGluAlaTyrTrpLysAlaThrHisProProSerSerThrPro                               450455460                                                                      CCCAATCCGCTGCTGAACCGCACCATGGCTACCGCAGCCCTGGCT1437                              ProAsnProLeuLeuAsnArgThrMetAlaThrAlaAlaLeuAla                                  465470475                                                                      GCCAGCCCTGCCCCTGTCTCCAACCTCCAGGTACAGGTGCTCACCTCGCTCAGGAGAAAC1497               CTCGGCTTCTGGGGCGGGCTGGAGTCCTCACAGCGGGGCAGTGTGGTGCCCCAGGAGCAG1557               GAACATGCCATGTAGTGGGCGCCCTGCCCGTCTTCCCTCCTGCTCTGGGGTCGGAACTGG1617               AGTGCAGGGAACATGGAGGAGGAAGGGAAGAGCTTTATTTTGTAAAAAAATAAGATGAGC1677               GGCA1681                                                                       (2) INFORMATION FOR SEQ ID NO:27:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1526 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..651                                                           (D) OTHER INFORMATION: /standard_name="Beta 1-4"                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                                       ATGGTCCAGAAGACCAGCATGTCCCGGGGCCCTTACCCACCCTCCCAG48                             MetValGlnLysThrSerMetSerArgGlyProTyrProProSerGln                               151015                                                                         GAGATCCCCATGGAGGTCTTCGACCCCAGCCCGCAGGGCAAATACAGC96                             GluIleProMetGluValPheAspProSerProGlnGlyLysTyrSer                               202530                                                                         AAGAGGAAAGGGCGATTCAAACGGTCAGATGGGAGCACGTCCTCGGAT144                            LysArgLysGlyArgPheLysArgSerAspGlySerThrSerSerAsp                               354045                                                                         ACCACATCCAACAGCTTTGTCCGCCAGGGCTCAGCGGAGTCCTACACC192                            ThrThrSerAsnSerPheValArgGlnGlySerAlaGluSerTyrThr                               505560                                                                         AGCCGTCCATCAGACTCTGATGTATCTCTGGAGGAGGACCGGGAAGCC240                            SerArgProSerAspSerAspValSerLeuGluGluAspArgGluAla                               65707580                                                                       TTAAGGAAGGAAGCAGAGCGCCAGGCATTAGCGCAGCTCGAGAAGGCC288                            LeuArgLysGluAlaGluArgGlnAlaLeuAlaGlnLeuGluLysAla                               859095                                                                         AAGACCAAGCCAGTGGCATTTGCTGTGCGGACAAATGTTGGCTACAAT336                            LysThrLysProValAlaPheAlaValArgThrAsnValGlyTyrAsn                               100105110                                                                      CCGTCTCCAGGGGATGAGGTGCCTGTGCAGGGAGTGGCCATCACCTTC384                            ProSerProGlyAspGluValProValGlnGlyValAlaIleThrPhe                               115120125                                                                      GAGCCCAAAGACTTCCTGCACATCAAGGAGAAATACAATAATGACTGG432                            GluProLysAspPheLeuHisIleLysGluLysTyrAsnAsnAspTrp                               130135140                                                                      TGGATCGGGCGGCTGGTGAAGGAGGGCTGTGAGGTTGGCTTCATTCCC480                            TrpIleGlyArgLeuValLysGluGlyCysGluValGlyPheIlePro                               145150155160                                                                   AGCCCCGTCAAACTGGACAGCCTTCGCCTGCTGCAGGAACAGAAGCTG528                            SerProValLysLeuAspSerLeuArgLeuLeuGlnGluGlnLysLeu                               165170175                                                                      CGCCAGAACCGCCTCGGCTCCAGCAAATCAGGCGATAACTCCAGTTCC576                            ArgGlnAsnArgLeuGlySerSerLysSerGlyAspAsnSerSerSer                               180185190                                                                      AGTCTGGGAGATGTGGTGACTGGCACCCGCCGCCCCACACCCCCTGCC624                            SerLeuGlyAspValValThrGlyThrArgArgProThrProProAla                               195200205                                                                      AGTGACAGAGCATGTGCCCCCCTATGACGTGGTGCCTTCCATGAGGCCCATCAT678                      SerAspArgAlaCysAlaProLeu                                                       210215                                                                         CCTGGTGGGACCGTCGCTCAAGGGCTACGAGGTTACAGACATGATGCAGAAAGCTTTATT738                TGACTTCTTGAAGCATCGGTTTGATGGCAGGATCTCCATCACTCGTGTGACGGCAGATAT798                TTCCCTGGCTAAGCGCTCAGTTCTCAACAACCCCAGCAAACACATCATCATTGAGCGCTC858                CAACACACGCTCCAGCCTGGCTGAGGTGCAGAGTGAAATCGAGCGAATCTTCGAGCTGGC918                CCGGACCCTTCAGTTGGTCGCTCTGGATGCTGACACCATCAATCACCCAGCCCAGCTGTC978                CAAGACCTCGCTGGCCCCCATCATTGTTTACATCAAGATCACCTCTCCCAAGGTACTTCA1038               AAGGCTCATCAAGTCCCGAGGAAAGTCTCAGTCCAAACACCTCAATGTCCAAATAGCGGC1098               CTCGGAAAAGCTGGCACAGTGCCCCCCTGAAATGTTTGACATCATCCTGGATGAGAACCA1158               ATTGGAGGATGCCTGCGAGCATCTGGCGGAGTACTTGGAAGCCTATTGGAAGGCCACACA1218               CCCGCCCAGCAGCACGCCACCCAATCCGCTGCTGAACCGCACCATGGCTACCGCAGCCCT1278               GGCTGCCAGCCCTGCCCCTGTCTCCAACCTCCAGGTACAGGTGCTCACCTCGCTCAGGAG1338               AAACCTCGGCTTCTGGGGCGGGCTGGAGTCCTCACAGCGGGGCAGTGTGGTGCCCCAGGA1398               GCAGGAACATGCCATGTAGTGGGCGCCCTGCCCGTCTTCCCTCCTGCTCTGGGGTCGGAA1458               CTGGAGTGCAGGGAACATGGAGGAGGAAGGGAAGAGCTTTATTTTGTAAAAAAATAAGAT1518               GAGCGGCA1526                                                                   (2) INFORMATION FOR SEQ ID NO:28:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1393 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..660                                                           (D) OTHER INFORMATION: /standard_name="Beta-1-5"                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                                       ATGGTCCAGAAGACCAGCATGTCCCGGGGCCCTTACCCACCCTCCCAG48                             MetValGlnLysThrSerMetSerArgGlyProTyrProProSerGln                               151015                                                                         GAGATCCCCATGGAGGTCTTCGACCCCAGCCCGCAGGGCAAATACAGC96                             GluIleProMetGluValPheAspProSerProGlnGlyLysTyrSer                               202530                                                                         AAGAGGAAAGGGCGATTCAAACGGTCAGATGGGAGCACGTCCTCGGAT144                            LysArgLysGlyArgPheLysArgSerAspGlySerThrSerSerAsp                               354045                                                                         ACCACATCCAACAGCTTTGTCCGCCAGGGCTCAGCGGAGTCCTACACC192                            ThrThrSerAsnSerPheValArgGlnGlySerAlaGluSerTyrThr                               505560                                                                         AGCCGTCCATCAGACTCTGATGTATCTCTGGAGGAGGACCGGGAAGCC240                            SerArgProSerAspSerAspValSerLeuGluGluAspArgGluAla                               65707580                                                                       TTAAGGAAGGAAGCAGAGCGCCAGGCATTAGCGCAGCTCGAGAAGGCC288                            LeuArgLysGluAlaGluArgGlnAlaLeuAlaGlnLeuGluLysAla                               859095                                                                         AAGACCAAGCCAGTGGCATTTGCTGTGCGGACAAATGTTGGCTACAAT336                            LysThrLysProValAlaPheAlaValArgThrAsnValGlyTyrAsn                               100105110                                                                      CCGTCTCCAGGGGATGAGGTGCCTGTGCAGGGAGTGGCCATCACCTTC384                            ProSerProGlyAspGluValProValGlnGlyValAlaIleThrPhe                               115120125                                                                      GAGCCCAAAGACTTCCTGCACATCAAGGAGAAATACAATAATGACTGG432                            GluProLysAspPheLeuHisIleLysGluLysTyrAsnAsnAspTrp                               130135140                                                                      TGGATCGGGCGGCTGGTGAAGGAGGGCTGTGAGGTTGGCTTCATTCCC480                            TrpIleGlyArgLeuValLysGluGlyCysGluValGlyPheIlePro                               145150155160                                                                   AGCCCCGTCAAACTGGACAGCCTTCGCCTGCTGCAGGAACAGAAGCTG528                            SerProValLysLeuAspSerLeuArgLeuLeuGlnGluGlnLysLeu                               165170175                                                                      CGCCAGAACCGCCTCGGCTCCAGCAAATCAGGCGATAACTCCAGTTCC576                            ArgGlnAsnArgLeuGlySerSerLysSerGlyAspAsnSerSerSer                               180185190                                                                      AGTCTGGGAGATGTGGTGACTGGCACCCGCCGCCCCACACCCCCTGCC624                            SerLeuGlyAspValValThrGlyThrArgArgProThrProProAla                               195200205                                                                      AGTGGTTACAGACATGATGCAGAAAGCTTTATTTGACTTCTTGAAGCATCGGT677                       SerGlyTyrArgHisAspAlaGluSerPheIle                                              210215220                                                                      TTGATGGCAGGATCTCCATCACTCGTGTGACGGCAGATATTTCCCTGGCTAAGCGCTCAG737                TTCTCAACAACCCCAGCAAACACATCATCATTGAGCGCTCCAACACACGCTCCAGCCTGG797                CTGAGGTGCAGAGTGAAATCGAGCGAATCTTCGAGCTGGCCCGGACCCTTCAGTTGGTCG857                CTCTGGATGCTGACACCATCAATCACCCAGCCCAGCTGTCCAAGACCTCGCTGGCCCCCA917                TCATTGTTTACATCAAGATCACCTCTCCCAAGGTACTTCAAAGGCTCATCAAGTCCCGAG977                GAAAGTCTCAGTCCAAACACCTCAATGTCCAAATAGCGGCCTCGGAAAAGCTGGCACAGT1037               GCCCCCCTGAAATGTTTGACATCATCCTGGATGAGAACCAATTGGAGGATGCCTGCGAGC1097               ATCTGGCGGAGTACTTGGAAGCCTATTGGAAGGCCACACACCCGCCCAGCAGCACGCCAC1157               CCAATCCGCTGCTGAACCGCACCATGGCTACCGCAGCCCTGGCTGCCAGCCCTGCCCCTG1217               TCTCCAACCTCCAGGTACAGGTGCTCACCTCGCTCAGGAGAAACCTCGGCTTCTGGGGCG1277               GGCTGGAGTCCTCACAGCGGGGCAGTGTGGTGCCCCAGGAGCAGGAACATGCCATGTAGT1337               GGGCGCCCTGCCCGTCTTCCCTCCTGCTCTGGGGTCGGAACTGGAGTGCAGGGAAC1393                   (2) INFORMATION FOR SEQ ID NO: 29:                                             (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7635 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 511..6996                                                        (ix) FEATURE:                                                                  (A) NAME/KEY: 5'UTR                                                            (B) LOCATION: 1..510                                                           (ix) FEATURE:                                                                  (A) NAME/KEY: 3'UTR                                                            (B) LOCATION: 6994..7635                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                                       GGGCGAGCGCCTCCGTCCCCGGATGTGAGCTCCGGCTGCCCGCGGTCCCGAGCCAGCGGC60                 GCGCGGGCGGCGGCGGCGGGCACCGGGCACCGCGGCGGGCGGGCAGACGGGCGGGCATGG120                GGGGAGCGCCGAGCGGCCCCGGCGGCCGGGCCGGCATCACCGCGGCGTCTCTCCGCTAGA180                GGAGGGGACAAGCCAGTTCTCCTTTGCAGCAAAAAATTACATGTATATATTATTAAGATA240                ATATATACATTGGATTTTATTTTTTTAAAAAGTTTATTTTGCTCCATTTTTGAAAAAGAG300                AGAGCTTGGGTGGCGAGCGGTTTTTTTTTAAAATCAATTATCCTTATTTTCTGTTATTTG360                TCCCCGTCCCTCCCCACCCCCCTGCTGAAGCGAGAATAAGGGCAGGGACCGCGGCTCCTA420                CCTCTTGGTGATCCCCTTCCCCATTCCGCCCCCGCCCCAACGCCCAGCACAGTGCCCTGC480                ACACAGTAGTCGCTCAATAAATGTTCGTGGATGATGATGATGATGATGATGAAA534                      MetMetMetMetMetMetMetLys                                                       15                                                                             AAAATGCAGCATCAACGGCAGCAGCAAGCGGACCACGCGAACGAGGCA582                            LysMetGlnHisGlnArgGlnGlnGlnAlaAspHisAlaAsnGluAla                               101520                                                                         AACTATGCAAGAGGCACCAGACTTCCTCTTTCTGGTGAAGGACCAACT630                            AsnTyrAlaArgGlyThrArgLeuProLeuSerGlyGluGlyProThr                               25303540                                                                       TCTCAGCCGAATAGCTCCAAGCAAACTGTCCTGTCTTGGCAAGCTGCA678                            SerGlnProAsnSerSerLysGlnThrValLeuSerTrpGlnAlaAla                               455055                                                                         ATCGATGCTGCTAGACAGGCCAAGGCTGCCCAAACTATGAGCACCTCT726                            IleAspAlaAlaArgGlnAlaLysAlaAlaGlnThrMetSerThrSer                               606570                                                                         GCACCCCCACCTGTAGGATCTCTCTCCCAAAGAAAACGTCAGCAATAC774                            AlaProProProValGlySerLeuSerGlnArgLysArgGlnGlnTyr                               758085                                                                         GCCAAGAGCAAAAAACAGGGTAACTCGTCCAACAGCCGACCTGCCCGC822                            AlaLysSerLysLysGlnGlyAsnSerSerAsnSerArgProAlaArg                               9095100                                                                        GCCCTTTTCTGTTTATCACTCAATAACCCCATCCGAAGAGCCTGCATT870                            AlaLeuPheCysLeuSerLeuAsnAsnProIleArgArgAlaCysIle                               105110115120                                                                   AGTATAGTGGAATGGAAACCATTTGACATATTTATATTATTGGCTATT918                            SerIleValGluTrpLysProPheAspIlePheIleLeuLeuAlaIle                               125130135                                                                      TTTGCCAATTGTGTGGCCTTAGCTATTTACATCCCATTCCCTGAAGAT966                            PheAlaAsnCysValAlaLeuAlaIleTyrIleProPheProGluAsp                               140145150                                                                      GATTCTAATTCAACAAATCATAACTTGGAAAAAGTAGAATATGCCTTC1014                           AspSerAsnSerThrAsnHisAsnLeuGluLysValGluTyrAlaPhe                               155160165                                                                      CTGATTATTTTTACAGTCGAGACATTTTTGAAGATTATAGCGTATGGA1062                           LeuIleIlePheThrValGluThrPheLeuLysIleIleAlaTyrGly                               170175180                                                                      TTATTGCTACATCCTAATGCTTATGTTAGGAATGGATGGAATTTACTG1110                           LeuLeuLeuHisProAsnAlaTyrValArgAsnGlyTrpAsnLeuLeu                               185190195200                                                                   GATTTTGTTATAGTAATAGTAGGATTGTTTAGTGTAATTTTGGAACAA1158                           AspPheValIleValIleValGlyLeuPheSerValIleLeuGluGln                               205210215                                                                      TTAACCAAAGAAACAGAAGGCGGGAACCACTCAAGCGGCAAATCTGGA1206                           LeuThrLysGluThrGluGlyGlyAsnHisSerSerGlyLysSerGly                               220225230                                                                      GGCTTTGATGTCAAAGCCCTCCGTGCCTTTCGAGTGTTGCGACCACTT1254                           GlyPheAspValLysAlaLeuArgAlaPheArgValLeuArgProLeu                               235240245                                                                      CGACTAGTGTCAGGAGTGCCCAGTTTACAAGTTGTCCTGAACTCCATT1302                           ArgLeuValSerGlyValProSerLeuGlnValValLeuAsnSerIle                               250255260                                                                      ATAAAAGCCATGGTTCCCCTCCTTCACATAGCCCTTTTGGTATTATTT1350                           IleLysAlaMetValProLeuLeuHisIleAlaLeuLeuValLeuPhe                               265270275280                                                                   GTAATCATAATCTATGCTATTATAGGATTGGAACTTTTTATTGGAAAA1398                           ValIleIleIleTyrAlaIleIleGlyLeuGluLeuPheIleGlyLys                               285290295                                                                      ATGCACAAAACATGTTTTTTTGCTGACTCAGATATCGTAGCTGAAGAG1446                           MetHisLysThrCysPhePheAlaAspSerAspIleValAlaGluGlu                               300305310                                                                      GACCCAGCTCCATGTGCGTTCTCAGGGAATGGACGCCAGTGTACTGCC1494                           AspProAlaProCysAlaPheSerGlyAsnGlyArgGlnCysThrAla                               315320325                                                                      AATGGCACGGAATGTAGGAGTGGCTGGGTTGGCCCGAACGGAGGCATC1542                           AsnGlyThrGluCysArgSerGlyTrpValGlyProAsnGlyGlyIle                               330335340                                                                      ACCAACTTTGATAACTTTGCCTTTGCCATGCTTACTGTGTTTCAGTGC1590                           ThrAsnPheAspAsnPheAlaPheAlaMetLeuThrValPheGlnCys                               345350355360                                                                   ATCACCATGGAGGGCTGGACAGACGTGCTCTACTGGGTAAATGATGCG1638                           IleThrMetGluGlyTrpThrAspValLeuTyrTrpValAsnAspAla                               365370375                                                                      ATAGGATGGGAATGGCCATGGGTGTATTTTGTTAGTCTGATCATCCTT1686                           IleGlyTrpGluTrpProTrpValTyrPheValSerLeuIleIleLeu                               380385390                                                                      GGCTCATTTTTCGTCCTTAACCTGGTTCTTGGTGTCCTTAGTGGAGAA1734                           GlySerPhePheValLeuAsnLeuValLeuGlyValLeuSerGlyGlu                               395400405                                                                      TTCTCAAAGGAAAGAGAGAAGGCAAAAGCACGGGGAGATTTCCAGAAG1782                           PheSerLysGluArgGluLysAlaLysAlaArgGlyAspPheGlnLys                               410415420                                                                      CTCCGGGAGAAGCAGCAGCTGGAGGAGGATCTAAAGGGCTACTTGGAT1830                           LeuArgGluLysGlnGlnLeuGluGluAspLeuLysGlyTyrLeuAsp                               425430435440                                                                   TGGATCACCCAAGCTGAGGACATCGATCCGGAGAATGAGGAAGAAGGA1878                           TrpIleThrGlnAlaGluAspIleAspProGluAsnGluGluGluGly                               445450455                                                                      GGAGAGGAAGGCAAACGAAATACTAGCATGCCCACCAGCGAGACTGAG1926                           GlyGluGluGlyLysArgAsnThrSerMetProThrSerGluThrGlu                               460465470                                                                      TCTGTGAACACAGAGAACGTCAGCGGTGAAGGCGAGAACCGAGGCTGC1974                           SerValAsnThrGluAsnValSerGlyGluGlyGluAsnArgGlyCys                               475480485                                                                      TGTGGAAGTCTCTGTCAAGCCATCTCAAAATCCAAACTCAGCCGACGC2022                           CysGlySerLeuCysGlnAlaIleSerLysSerLysLeuSerArgArg                               490495500                                                                      TGGCGTCGCTGGAACCGATTCAATCGCAGAAGATGTAGGGCCGCCGTG2070                           TrpArgArgTrpAsnArgPheAsnArgArgArgCysArgAlaAlaVal                               505510515520                                                                   AAGTCTGTCACGTTTTACTGGCTGGTTATCGTCCTGGTGTTTCTGAAC2118                           LysSerValThrPheTyrTrpLeuValIleValLeuValPheLeuAsn                               525530535                                                                      ACCTTAACCATTTCCTCTGAGCACTACAATCAGCCAGATTGGTTGACA2166                           ThrLeuThrIleSerSerGluHisTyrAsnGlnProAspTrpLeuThr                               540545550                                                                      CAGATTCAAGATATTGCCAACAAAGTCCTCTTGGCTCTGTTCACCTGC2214                           GlnIleGlnAspIleAlaAsnLysValLeuLeuAlaLeuPheThrCys                               555560565                                                                      GAGATGCTGGTAAAAATGTACAGCTTGGGCCTCCAAGCATATTTCGTC2262                           GluMetLeuValLysMetTyrSerLeuGlyLeuGlnAlaTyrPheVal                               570575580                                                                      TCTCTTTTCAACCGGTTTGATTGCTTCGTGGTGTGTGGTGGAATCACT2310                           SerLeuPheAsnArgPheAspCysPheValValCysGlyGlyIleThr                               585590595600                                                                   GAGACGATCTTGGTGGAACTGGAAATCATGTCTCCCCTGGGGATCTCT2358                           GluThrIleLeuValGluLeuGluIleMetSerProLeuGlyIleSer                               605610615                                                                      GTGTTTCGGTGTGTGCGCCTCTTAAGAATCTTCAAAGTGACCAGGCAC2406                           ValPheArgCysValArgLeuLeuArgIlePheLysValThrArgHis                               620625630                                                                      TGGACTTCCCTGAGCAACTTAGTGGCATCCTTATTAAACTCCATGAAG2454                           TrpThrSerLeuSerAsnLeuValAlaSerLeuLeuAsnSerMetLys                               635640645                                                                      TCCATCGCTTCGCTGTTGCTTCTGCTTTTTCTCTTCATTATCATCTTT2502                           SerIleAlaSerLeuLeuLeuLeuLeuPheLeuPheIleIleIlePhe                               650655660                                                                      TCCTTGCTTGGGATGCAGCTGTTTGGCGGCAAGTTTAATTTTGATGAA2550                           SerLeuLeuGlyMetGlnLeuPheGlyGlyLysPheAsnPheAspGlu                               665670675680                                                                   ACGCAAACCAAGCGGAGCACCTTTGACAATTTCCCTCAAGCACTTCTC2598                           ThrGlnThrLysArgSerThrPheAspAsnPheProGlnAlaLeuLeu                               685690695                                                                      ACAGTGTTCCAGATCCTGACAGGCGAAGACTGGAATGCTGTGATGTAC2646                           ThrValPheGlnIleLeuThrGlyGluAspTrpAsnAlaValMetTyr                               700705710                                                                      GATGGCATCATGGCTTACGGGGGCCCATCCTCTTCAGGAATGATCGTC2694                           AspGlyIleMetAlaTyrGlyGlyProSerSerSerGlyMetIleVal                               715720725                                                                      TGCATCTACTTCATCATCCTCTTCATTTGTGGTAACTATATTCTACTG2742                           CysIleTyrPheIleIleLeuPheIleCysGlyAsnTyrIleLeuLeu                               730735740                                                                      AATGTCTTCTTGGCCATCGCTGTAGACAATTTGGCTGATGCTGAAAGT2790                           AsnValPheLeuAlaIleAlaValAspAsnLeuAlaAspAlaGluSer                               745750755760                                                                   CTGAACACTGCTCAGAAAGAAGAAGCGGAAGAAAAGGAGAGGAAAAAG2838                           LeuAsnThrAlaGlnLysGluGluAlaGluGluLysGluArgLysLys                               765770775                                                                      ATTGCCAGAAAAGAGAGCCTAGAAAATAAAAAGAACAACAAACCAGAA2886                           IleAlaArgLysGluSerLeuGluAsnLysLysAsnAsnLysProGlu                               780785790                                                                      GTCAACCAGATAGCCAACAGTGACAACAAGGTTACAATTGATGACTAT2934                           ValAsnGlnIleAlaAsnSerAspAsnLysValThrIleAspAspTyr                               795800805                                                                      AGAGAAGAGGATGAAGACAAGGACCCCTATCCGCCTTGCGATGTGCCA2982                           ArgGluGluAspGluAspLysAspProTyrProProCysAspValPro                               810815820                                                                      GTAGGGGAAGAGGAAGAGGAAGAGGAGGAGGATGAACCTGAGGTTCCT3030                           ValGlyGluGluGluGluGluGluGluGluAspGluProGluValPro                               825830835840                                                                   GCCGGACCCCGTCCTCGAAGGATCTCGGAGTTGAACATGAAGGAAAAA3078                           AlaGlyProArgProArgArgIleSerGluLeuAsnMetLysGluLys                               845850855                                                                      ATTGCCCCCATCCCTGAAGGGAGCGCTTTCTTCATTCTTAGCAAGACC3126                           IleAlaProIleProGluGlySerAlaPhePheIleLeuSerLysThr                               860865870                                                                      AACCCGATCCGCGTAGGCTGCCACAAGCTCATCAACCACCACATCTTC3174                           AsnProIleArgValGlyCysHisLysLeuIleAsnHisHisIlePhe                               875880885                                                                      ACCAACCTCATCCTTGTCTTCATCATGCTGAGCAGTGCTGCCCTGGCC3222                           ThrAsnLeuIleLeuValPheIleMetLeuSerSerAlaAlaLeuAla                               890895900                                                                      GCAGAGGACCCCATCCGCAGCCACTCCTTCCGGAACACGATACTGGGT3270                           AlaGluAspProIleArgSerHisSerPheArgAsnThrIleLeuGly                               905910915920                                                                   TACTTTGACTATGCCTTCACAGCCATCTTTACTGTTGAGATCCTGTTG3318                           TyrPheAspTyrAlaPheThrAlaIlePheThrValGluIleLeuLeu                               925930935                                                                      AAGATGACAACTTTTGGAGCTTTCCTCCACAAAGGGGCCTTCTGCAGG3366                           LysMetThrThrPheGlyAlaPheLeuHisLysGlyAlaPheCysArg                               940945950                                                                      AACTACTTCAATTTGCTGGATATGCTGGTGGTTGGGGTGTCTCTGGTG3414                           AsnTyrPheAsnLeuLeuAspMetLeuValValGlyValSerLeuVal                               955960965                                                                      TCATTTGGGATTCAATCCAGTGCCATCTCCGTTGTGAAGATTCTGAGG3462                           SerPheGlyIleGlnSerSerAlaIleSerValValLysIleLeuArg                               970975980                                                                      GTCTTAAGGGTCCTGCGTCCCCTCAGGGCCATCAACAGAGCAAAAGGA3510                           ValLeuArgValLeuArgProLeuArgAlaIleAsnArgAlaLysGly                               9859909951000                                                                  CTTAAGCACGTGGTCCAGTGCGTCTTCGTGGCCATCCGGACCATCGGC3558                           LeuLysHisValValGlnCysValPheValAlaIleArgThrIleGly                               100510101015                                                                   AACATCATGATCGTCACCACCCTCCTGCAGTTCATGTTTGCCTGTATC3606                           AsnIleMetIleValThrThrLeuLeuGlnPheMetPheAlaCysIle                               102010251030                                                                   GGGGTCCAGTTGTTCAAGGGGAAGTTCTATCGCTGTACGGATGAAGCC3654                           GlyValGlnLeuPheLysGlyLysPheTyrArgCysThrAspGluAla                               103510401045                                                                   AAAAGTAACCCTGAAGAATGCAGGGGACTTTTCATCCTCTACAAGGAT3702                           LysSerAsnProGluGluCysArgGlyLeuPheIleLeuTyrLysAsp                               105010551060                                                                   GGGGATGTTGACAGTCCTGTGGTCCGTGAACGGATCTGGCAAAACAGT3750                           GlyAspValAspSerProValValArgGluArgIleTrpGlnAsnSer                               1065107010751080                                                               GATTTCAACTTCGACAACGTCCTCTCTGCTATGATGGCGCTCTTCACA3798                           AspPheAsnPheAspAsnValLeuSerAlaMetMetAlaLeuPheThr                               108510901095                                                                   GTCTCCACGTTTGAGGGCTGGCCTGCGTTGCTGTATAAAGCCATCGAC3846                           ValSerThrPheGluGlyTrpProAlaLeuLeuTyrLysAlaIleAsp                               110011051110                                                                   TCGAATGGAGAGAACATCGGCCCAATCTACAACCACCGCGTGGAGATC3894                           SerAsnGlyGluAsnIleGlyProIleTyrAsnHisArgValGluIle                               111511201125                                                                   TCCATCTTCTTCATCATCTACATCATCATTGTAGCTTTCTTCATGATG3942                           SerIlePhePheIleIleTyrIleIleIleValAlaPhePheMetMet                               113011351140                                                                   AACATCTTTGTGGGCTTTGTCATCGTTACATTTCAGGAACAAGGAGAA3990                           AsnIlePheValGlyPheValIleValThrPheGlnGluGlnGlyGlu                               1145115011551160                                                               AAAGAGTATAAGAACTGTGAGCTGGACAAAAATCAGCGTCAGTGTGTT4038                           LysGluTyrLysAsnCysGluLeuAspLysAsnGlnArgGlnCysVal                               116511701175                                                                   GAATACGCCTTGAAAGCACGTCCCTTGCGGAGATACATCCCCAAAAAC4086                           GluTyrAlaLeuLysAlaArgProLeuArgArgTyrIleProLysAsn                               118011851190                                                                   CCCTACCAGTACAAGTTCTGGTACGTGGTGAACTCTTCGCCTTTCGAA4134                           ProTyrGlnTyrLysPheTrpTyrValValAsnSerSerProPheGlu                               119512001205                                                                   TACATGATGTTTGTCCTCATCATGCTCAACACACTCTGCTTGGCCATG4182                           TyrMetMetPheValLeuIleMetLeuAsnThrLeuCysLeuAlaMet                               121012151220                                                                   CAGCACTACGAGCAGTCCAAGATGTTCAATGATGCCATGGACATTCTG4230                           GlnHisTyrGluGlnSerLysMetPheAsnAspAlaMetAspIleLeu                               1225123012351240                                                               AACATGGTCTTCACCGGGGTGTTCACCGTCGAGATGGTTTTGAAAGTC4278                           AsnMetValPheThrGlyValPheThrValGluMetValLeuLysVal                               124512501255                                                                   ATCGCATTTAAGCCTAAGGGGTATTTTAGTGACGCCTGGAACACGTTT4326                           IleAlaPheLysProLysGlyTyrPheSerAspAlaTrpAsnThrPhe                               126012651270                                                                   GACTCCCTCATCGTAATCGGCAGCATTATAGACGTGGCCCTCAGCGAA4374                           AspSerLeuIleValIleGlySerIleIleAspValAlaLeuSerGlu                               127512801285                                                                   GCAGACCCAACTGAAAGTGAAAATGTCCCTGTCCCAACTGCTACACCT4422                           AlaAspProThrGluSerGluAsnValProValProThrAlaThrPro                               129012951300                                                                   GGGAACTCTGAAGAGAGCAATAGAATCTCCATCACCTTTTTCCGTCTT4470                           GlyAsnSerGluGluSerAsnArgIleSerIleThrPhePheArgLeu                               1305131013151320                                                               TTCCGAGTGATGCGATTGGTGAAGCTTCTCAGCAGGGGGGAAGGCATC4518                           PheArgValMetArgLeuValLysLeuLeuSerArgGlyGluGlyIle                               132513301335                                                                   CGGACATTGCTGTGGACTTTTATTAAGTTCTTTCAGGCGCTCCCGTAT4566                           ArgThrLeuLeuTrpThrPheIleLysPhePheGlnAlaLeuProTyr                               134013451350                                                                   GTGGCCCTCCTCATAGCCATGCTGTTCTTCATCTATGCGGTCATTGGC4614                           ValAlaLeuLeuIleAlaMetLeuPhePheIleTyrAlaValIleGly                               135513601365                                                                   ATGCAGATGTTTGGGAAAGTTGCCATGAGAGATAACAACCAGATCAAT4662                           MetGlnMetPheGlyLysValAlaMetArgAspAsnAsnGlnIleAsn                               137013751380                                                                   AGGAACAATAACTTCCAGACGTTTCCCCAGGCGGTGCTGCTGCTCTTC4710                           ArgAsnAsnAsnPheGlnThrPheProGlnAlaValLeuLeuLeuPhe                               1385139013951400                                                               AGGTGTGCAACAGGTGAGGCCTGGCAGGAGATCATGCTGGCCTGTCTC4758                           ArgCysAlaThrGlyGluAlaTrpGlnGluIleMetLeuAlaCysLeu                               140514101415                                                                   CCAGGGAAGCTCTGTGACCCTGAGTCAGATTACAACCCCGGGGAGGAG4806                           ProGlyLysLeuCysAspProGluSerAspTyrAsnProGlyGluGlu                               142014251430                                                                   CATACATGTGGGAGCAACTTTGCCATTGTCTATTTCATCAGTTTTTAC4854                           HisThrCysGlySerAsnPheAlaIleValTyrPheIleSerPheTyr                               143514401445                                                                   ATGCTCTGTGCATTTCTGATCATCAATCTGTTTGTGGCTGTCATCATG4902                           MetLeuCysAlaPheLeuIleIleAsnLeuPheValAlaValIleMet                               145014551460                                                                   GATAATTTCGACTATCTGACCCGGGACTGGTCTATTTTGGGGCCTCAC4950                           AspAsnPheAspTyrLeuThrArgAspTrpSerIleLeuGlyProHis                               1465147014751480                                                               CATTTAGATGAATTCAAAAGAATATGGTCAGAATATGACCCTGAGGCA4998                           HisLeuAspGluPheLysArgIleTrpSerGluTyrAspProGluAla                               148514901495                                                                   AAGGGAAGGATAAAACACCTTGATGTGGTCACTCTGCTTCGACGCATC5046                           LysGlyArgIleLysHisLeuAspValValThrLeuLeuArgArgIle                               150015051510                                                                   CAGCCTCCCCTGGGGTTTGGGAAGTTATGTCCACACAGGGTAGCGTGC5094                           GlnProProLeuGlyPheGlyLysLeuCysProHisArgValAlaCys                               151515201525                                                                   AAGAGATTAGTTGCCATGAACATGCCTCTCAACAGTGACGGGACAGTC5142                           LysArgLeuValAlaMetAsnMetProLeuAsnSerAspGlyThrVal                               153015351540                                                                   ATGTTTAATGCAACCCTGTTTGCTTTGGTTCGAACGGCTCTTAAGATC5190                           MetPheAsnAlaThrLeuPheAlaLeuValArgThrAlaLeuLysIle                               1545155015551560                                                               AAGACCGAAGGGAACCTGGAGCAAGCTAATGAAGAACTTCGGGCTGTG5238                           LysThrGluGlyAsnLeuGluGlnAlaAsnGluGluLeuArgAlaVal                               156515701575                                                                   ATAAAGAAAATTTGGAAGAAAACCAGCATGAAATTACTTGACCAAGTT5286                           IleLysLysIleTrpLysLysThrSerMetLysLeuLeuAspGlnVal                               158015851590                                                                   GTCCCTCCAGCTGGTGATGATGAGGTAACCGTGGGGAAGTTCTATGCC5334                           ValProProAlaGlyAspAspGluValThrValGlyLysPheTyrAla                               159516001605                                                                   ACTTTCCTGATACAGGACTACTTTAGGAAATTCAAGAAACGGAAAGAA5382                           ThrPheLeuIleGlnAspTyrPheArgLysPheLysLysArgLysGlu                               161016151620                                                                   CAAGGACTGGTGGGAAAGTACCCTGCGAAGAACACCACAATTGCCCTA5430                           GlnGlyLeuValGlyLysTyrProAlaLysAsnThrThrIleAlaLeu                               1625163016351640                                                               CAGGCGGGATTAAGGACACTGCATGACATTGGGCCAGAAATCCGGCGT5478                           GlnAlaGlyLeuArgThrLeuHisAspIleGlyProGluIleArgArg                               164516501655                                                                   GCTATATCGTGTGATTTGCAAGATGACGAGCCTGAGGAAACAAAACGA5526                           AlaIleSerCysAspLeuGlnAspAspGluProGluGluThrLysArg                               166016651670                                                                   GAAGAAGAAGATGATGTGTTCAAAAGAAATGGTGCCCTGCTTGGAAAC5574                           GluGluGluAspAspValPheLysArgAsnGlyAlaLeuLeuGlyAsn                               167516801685                                                                   CATGTCAATCATGTTAATAGTGATAGGAGAGATTCCCTTCAGCAGACC5622                           HisValAsnHisValAsnSerAspArgArgAspSerLeuGlnGlnThr                               169016951700                                                                   AATACCACCCACCGTCCCCTGCATGTCCAAAGGCCTTCAATTCCACCT5670                           AsnThrThrHisArgProLeuHisValGlnArgProSerIleProPro                               1705171017151720                                                               GCAAGTGATACTGAGAAACCGCTGTTTCCTCCAGCAGGAAATTCGGTG5718                           AlaSerAspThrGluLysProLeuPheProProAlaGlyAsnSerVal                               172517301735                                                                   TGTCATAACCATCATAACCATAATTCCATAGGAAAGCAAGTTCCCACC5766                           CysHisAsnHisHisAsnHisAsnSerIleGlyLysGlnValProThr                               174017451750                                                                   TCAACAAATGCCAATCTCAATAATGCCAATATGTCCAAAGCTGCCCAT5814                           SerThrAsnAlaAsnLeuAsnAsnAlaAsnMetSerLysAlaAlaHis                               175517601765                                                                   GGAAAGCGGCCCAGCATTGGGAACCTTGAGCATGTGTCTGAAAATGGG5862                           GlyLysArgProSerIleGlyAsnLeuGluHisValSerGluAsnGly                               177017751780                                                                   CATCATTCTTCCCACAAGCATGACCGGGAGCCTCAGAGAAGGTCCAGT5910                           HisHisSerSerHisLysHisAspArgGluProGlnArgArgSerSer                               1785179017951800                                                               GTGAAAAGAACCCGCTATTATGAAACTTACATTAGGTCCGACTCAGGA5958                           ValLysArgThrArgTyrTyrGluThrTyrIleArgSerAspSerGly                               180518101815                                                                   GATGAACAGCTCCCAACTATTTGCCGGGAAGACCCAGAGATACATGGC6006                           AspGluGlnLeuProThrIleCysArgGluAspProGluIleHisGly                               182018251830                                                                   TATTTCAGGGACCCCCACTGCTTGGGGGAGCAGGAGTATTTCAGTAGT6054                           TyrPheArgAspProHisCysLeuGlyGluGlnGluTyrPheSerSer                               183518401845                                                                   GAGGAATGCTACGAGGATGACAGCTCGCCCACCTGGAGCAGGCAAAAC6102                           GluGluCysTyrGluAspAspSerSerProThrTrpSerArgGlnAsn                               185018551860                                                                   TATGGCTACTACAGCAGATACCCAGGCAGAAACATCGACTCTGAGAGG6150                           TyrGlyTyrTyrSerArgTyrProGlyArgAsnIleAspSerGluArg                               1865187018751880                                                               CCCCGAGGCTACCATCATCCCCAAGGATTCTTGGAGGACGATGACTCG6198                           ProArgGlyTyrHisHisProGlnGlyPheLeuGluAspAspAspSer                               188518901895                                                                   CCCGTTTGCTATGATTCACGGAGATCTCCAAGGAGACGCCTACTACCT6246                           ProValCysTyrAspSerArgArgSerProArgArgArgLeuLeuPro                               190019051910                                                                   CCCACCCCAGCATCCCACCGGAGATCCTCCTTCAACTTTGAGTGCCTG6294                           ProThrProAlaSerHisArgArgSerSerPheAsnPheGluCysLeu                               191519201925                                                                   CGCCGGCAGAGCAGCCAGGAAGAGGTCCCGTCGTCTCCCATCTTCCCC6342                           ArgArgGlnSerSerGlnGluGluValProSerSerProIlePhePro                               193019351940                                                                   CATCGCACGGCCCTGCCTCTGCATCTAATGCAGCAACAGATCATGGCA6390                           HisArgThrAlaLeuProLeuHisLeuMetGlnGlnGlnIleMetAla                               1945195019551960                                                               GTTGCCGGCCTAGATTCAAGTAAAGCCCAGAAGTACTCACCGAGTCAC6438                           ValAlaGlyLeuAspSerSerLysAlaGlnLysTyrSerProSerHis                               196519701975                                                                   TCGACCCGGTCGTGGGCCACCCCTCCAGCAACCCCTCCCTACCGGGAC6486                           SerThrArgSerTrpAlaThrProProAlaThrProProTyrArgAsp                               198019851990                                                                   TGGACACCGTGCTACACCCCCCTGATCCAAGTGGAGCAGTCAGAGGCC6534                           TrpThrProCysTyrThrProLeuIleGlnValGluGlnSerGluAla                               199520002005                                                                   CTGGACCAGGTGAACGGCAGCCTGCCGTCCCTGCACCGCAGCTCCTGG6582                           LeuAspGlnValAsnGlySerLeuProSerLeuHisArgSerSerTrp                               201020152020                                                                   TACACAGACGAGCCCGACATCTCCTACCGGACTTTCACACCAGCCAGC6630                           TyrThrAspGluProAspIleSerTyrArgThrPheThrProAlaSer                               2025203020352040                                                               CTGACTGTCCCCAGCAGCTTCCGGAACAAAAACAGCGACAAGCAGAGG6678                           LeuThrValProSerSerPheArgAsnLysAsnSerAspLysGlnArg                               204520502055                                                                   AGTGCGGACAGCTTGGTGGAGGCAGTCCTGATATCCGAAGGCTTGGGA6726                           SerAlaAspSerLeuValGluAlaValLeuIleSerGluGlyLeuGly                               206020652070                                                                   CGCTATGCAAGGGACCCAAAATTTGTGTCAGCAACAAAACACGAAATC6774                           ArgTyrAlaArgAspProLysPheValSerAlaThrLysHisGluIle                               207520802085                                                                   GCTGATGCCTGTGACCTCACCATCGACGAGATGGAGAGTGCAGCCAGC6822                           AlaAspAlaCysAspLeuThrIleAspGluMetGluSerAlaAlaSer                               209020952100                                                                   ACCCTGCTTAATGGGAACGTGCGTCCCCGAGCCAACGGGGATGTGGGC6870                           ThrLeuLeuAsnGlyAsnValArgProArgAlaAsnGlyAspValGly                               2105211021152120                                                               CCCCTCTCACACCGGCAGGACTATGAGCTACAGGACTTTGGTCCTGGC6918                           ProLeuSerHisArgGlnAspTyrGluLeuGlnAspPheGlyProGly                               212521302135                                                                   TACAGCGACGAAGAGCCAGACCCTGGGAGGGATGAGGAGGACCTGGCG6966                           TyrSerAspGluGluProAspProGlyArgAspGluGluAspLeuAla                               214021452150                                                                   GATGAAATGATATGCATCACCACCTTGTAGCCCCCAGCGAGGGGCAG7013                            AspGluMetIleCysIleThrThrLeu                                                    21552160                                                                       ACTGGCTCTGGCCTCAGGTGGGGCGCAGGAGAGCCAGGGGAAAAGTGCCTCATAGTTAGG7073               AAAGTTTAGGCACTAGTTGGGAGTAATATTCAATTAATTAGACTTTTGTATAAGAGATGT7133               CATGCCTCAAGAAAGCCATAAACCTGGTAGGAACAGGTCCCAAGCGGTTGAGCCTGGCAG7193               AGTACCATGCGCTCGGCCCCAGCTGCAGGAAACAGCAGGCCCCGCCCTCTCACAGAGGAT7253               GGGTGAGGAGGCCAGACCTGCCCTGCCCCATTGTCCAGATGGGCACTGCTGTGGAGTCTG7313               CTTCTCCCATGTACCAGGGCACCAGGCCCACCCAACTGAAGGCATGGCGGCGGGGTGCAG7373               GGGAAAGTTAAAGGTGATGACGATCATCACACCTGTGTCGTTACCTCAGCCATCGGTCTA7433               GCATATCAGTCACTGGGCCCAACATATCCATTTTTAAACCCTTTCCCCCAAATACACTGC7493               GTCCTGGTTCCTGTTTAGCTGTTCTGAAATACGGTGTGTAAGTAAGTCAGAACCCAGCTA7553               CCAGTGATTATTGCGAGGGCAATGGGACCTCATAAATAAGGTTTTCTGTGATGTGACGCC7613               AGTTTACATAAGAGAATATCAC7635                                                     __________________________________________________________________________ 

What is claimed is:
 1. A method for identifying a compound that modulates the activity of a calcium channel, comprising;suspending a eukaryotic cell which has a functional, heterologous calcium channel, in a solution containing a test compound and a calcium channel selective ion; depolarizing the cell membrane of said cell; detecting the current flowing into said cell; and comparing the current flowing into the cell with the current that flows into the same or a substantially identical cell suspended in the presence of the same calcium channel selective ion but in the absence of the compound, whereby test compounds that modulate the activity of a calcium channel are identified, wherein: the heterologous calcium channel includes one or more human calcium channel subunits expressed by nucleic acid that is heterologous to said cell; the only heterologous ion channels that are expressed by the cells are calcium channels; the current that is detected in the presence of a compound that modulates the activity of the heterologous calcium channel is different from that produced when the same or a substantially identical cell in the presence of the same calcium channel selective ion but in the absence of said compound is depolarized; the one or more subunits comprise α₁ -subunit selected from the group consisting of:an α_(1C) -subunit comprising the sequence of amino acids encoded by the sequence nucleotides set forth in SEQ ID NO. 3; an a_(1D) subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO: 1 or SEQ ID NO. 29; a human calcium channel α₁ -subunit that is encoded by a DNA molecule capable of hybridizing under conditions of high stringency with DNA that is fully complementary to an mRNA transcript native to a human cell that encodes one of the aforesaid human calcium channel α₁ -subunits, such that any probe that contains at least 14 contiguous bases from the coding portion of the α₁ -subunit-encoding DNA molecule is capable of hybridizing under conditions of high stringency to the DNA that is fully complementary to the mRNA transcript; an α_(1C) -subunit encoded by an mRNA transcript native to a human cell, wherein DNA that is fully complementary to the transcript is capable of hybridizing under conditions of high stringency to a probe having the sequence of nucleotides set forth in SEQ ID No. 4, SEQ ID No. 5 or SEQ ID No. 6; and an α_(1D) -subunit encoded by an mRNA transcript native to a human cell, wherein DNA that is fully complementary to the transcript is capable of hybridizing under conditions of high stringency to a probe having the sequence of nucleotides set forth in SEQ ID No.
 2. 2. The method of claim 1, wherein prior to the depolarization step the cell is maintained at a holding potential which substantially inactivates calcium channels which are endogenous to said cell.
 3. The method of claim 1, wherein:the cell is an amphibian oocyte; the heterologous nucleic acid is RNA that is injected into the oocyte; and the one or more human calcium channel subunits further comprise a β₂ -subunit.
 4. The method of claim 1, wherein:the cell is an amphibian oocyte; the heterologous nucleic acid is RNA that is injected into the oocyte; and the one or more human calcium channel subunits further comprise a β-subunit, wherein: the β-subunit is selected from the group consisting of: a β₁₋₁ -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 26; a β₁₋₂ -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 9; a β₁₋₃ -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 10; a β₁₋₄ -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 27; a β₁₋₅ -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 28; and a human calcium channel β-subunit that is encoded by a DNA molecule capable of hybridizing under conditions of high stringency with DNA that is fully complementary to an mRNA transcript native to a human cell that encodes one of the aforesaid human calcium channel β-subunits.
 5. The method of claim 4, wherein the subunits encoded by said RNA further comprise an α₂ -subunit, a γ-subunit or an a α₂ -subunit and γ-subunit.
 6. The method of claim 1, wherein the cell is an HEK cell and the heterologous nucleic acid is DNA.
 7. A substantially pure α₁ -subunit of a human calcium channel, wherein:the α₁ -subunit is selected from the group consisting of:an α_(1C) -subunit comprising the sequence of amino acids encoded by the sequence nucleotides set forth in SEQ ID NO. 3; an α_(1D) -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO: 1 or SEQ ID NO. 29; an α_(1A) -subunit comprising the sequence of amino acids encoded by the α_(1A) -encoding DNA in a phage having all of the identifying characteristics of the phage deposited as ATCC Accession No. 75293; a human calcium channel α₁ -subunit that is encoded by a DNA molecule capable of hybridizing under conditions of high stringency with DNA that is fully complementary to an mRNA transcript native to a human cell that encodes one of the aforesaid human calcium channel α₁ -subunits, such that any probe that contains at least 14 contiguous bases from the coding portion of the α₁ -subunit-encoding DNA molecule is capable of hybridizing under conditions of high stringency to the DNA that is fully complementary to the mRNA transcripts; an α_(1A) -subunit encoded by an mRNA transcript native to a human cell, wherein DNA that is fully complementary to the transcript is capable of hybridizing under conditions of high stringency to a probe having the sequence of nucleotides set forth in SEQ ID No. 21; an α_(1C) -subunit encoded by an mRNA transcript native to a human cell, wherein DNA that is fully complementary to the transcript is capable of hybridizing under conditions of high stringency to a probe having the sequence of nucleotides set forth in SEQ ID No. 4, SEQ ID No. 5 or SEQ ID No. 6; and an α_(1D) -Subunit encoded by an mRNA transcript native to a human cell, wherein DNA that is fully complementary to the transcript is capable of hybridizing under conditions of high stringency to a probe having the sequence of nucleotides set forth in SEQ ID No.
 2. 8. The subunit of claim 7 that is an α_(1D) -subunit.
 9. The subunit of claim 8, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 1. 10. The subunit of claim 8, comprising the sequence of amino acids that is encoded by the sequence of nucleotides set forth in SEQ ID NO.
 2. 11. The subunit of claim 7 that is an α_(1C) subunit.
 12. The subunit of claim 11, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 3. 13. The subunit of claim 11, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 4. 14. The subunit of claim 11, comprising the sequence of amino acids that is encoded by the sequence of nucleotides set forth in SEQ ID NO.
 5. 15. The subunit of claim 11, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 6. 16. The subunit of claim 7 that is an α_(1A) -subunit.
 17. The subunit of claim 16, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 21. 18. The subunit of claim 16 comprising the sequence of amino acids encoded by a sequence of nucleotides comprising the sequence of nucleotides of the α_(1a) -encoding DNA in a phage having all of the identifying characteristics of the phage deposited under ATCC Accession No.
 75293. 19. A substantially pure α₁ -subunit of a human calcium channel that is encoded by nucleic acid native to a human cell, wherein:(a) the α₁ -subunit is selected from the group consisting of:an α_(1c) -subunit encoded by a DNA molecule comprising the sequence nucleotides set forth in SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5 or SEQ ID NO. 6; an α_(1D) subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO: 1 or SEQ ID NO. 29; an α_(1A) -subunit encoded by a DNA molecule comprising the sequence of nucleotides set forth in SEQ ID NO. 21 or encoded by the α_(1A) -encoding DNA in a phage having all of the identifying characteristics of the phage deposited as ATCC Accession No. 75293; anda human calcium channel α₁ -subunit that is encoded by DNA capable of hybridizing under conditions of high stringency with DNA that is fully complementary to the mRNA transcript native to a human cell; and (b) the α₁ -subunit binds to at least one compound selected from the group consisting of a dihydropyridines, a phenylalkylamines, ω-CgTx, and a pyrazonoylguanidines.
 20. A substantially pure α₂ -subunit of a human calcium channel selected from the group consisting of:an α_(2a) -subunit comprising the sequence of amino acids encoded by the sequence nucleotides set forth in SEQ ID NO. 22; an α_(2c) subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 23; an α_(2d) subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 24; and an α_(2e) subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 25. 21. The subunit of claim 20 that is an α_(2a) -subunit.
 22. The subunit of claim 21, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 22. 23. The subunit of claim 20 that is an α_(2c) -subunit.
 24. The subunit of claim 23, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 23. 25. The subunit of claim 20 that is an α_(2d) -subunit.
 26. The subunit of claim 25, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 24. 27. The subunit of claim 20 that is an α_(2e) -subunit.
 28. The subunit of claim 27 comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 25. 29. An isolated α_(2b) -subunit of a human calcium channel comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 11. 30. A substantially pure β₁ -subunit of a human calcium channel selected from the group consisting of:a β₁₋₁ -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 26; a β₁₋₂ -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 9; a β₁₋₃ -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.10; a β₁₋₄ -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 27; a β₁₋₅ -subunit comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO. 28; and a β₁ subunit encoded by an mRNA transcript native to a human cell, wherein DNA that is fully complementary to the transcript is capable of hybridizing under conditions of high stringency to a probe having a sequence corresponding to at least 30 contiguous nucleotides present in SEQ ID NO:
 26. SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:27 or SEQ NO: 28; anda human calcium channel β₁ -subunit that is encoded by a DNA molecule capable of hybridizing under conditions of high stringency with DNA that is fully complementary to an mRNA transcript native to a human cell that encodes one of the aforesaid human calcium channel β₁ -subunits; such that any probe that contains at least 14 contiguous bases from the coding portion of the β₁ -subunit-encoding DNA molecule is capable of hybridizing under conditions of high stringency to the DNA molecule that is fully complementary to the mRNA transcript.
 31. The subunit of claim 30 that is an β1-1-subunit.
 32. The subunit of claim 31 comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 26. 33. The subunit of claim 30 that is an β₁₋₂ -subunit.
 34. The subunit of claim 33, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 9. 35. The subunit of claim 30 that is an β₁₋₃ -subunit.
 36. The subunit of claim 35, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 10. 37. The subunit of claim 30 that is an β₁₋₄ -subunit.
 38. The subunit of claim 37, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 27. 39. The subunit of claim 30 that is an β₁₋₅ -subunit.
 40. The subunit of claim 39, comprising the sequence of amino acids encoded by the sequence of nucleotides set forth in SEQ ID NO.
 28. 41. A substantially pure peptide, comprising the sequence of amino acids encoded by SEQ ID No. 3 or
 6. 42. A substantially pure peptide, comprising the sequence of amino acids encoded by SEQ ID No.
 2. 